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Glossary 

Ecosystem services Benefits provided by natural sys¬ 
tems that do not necessarily require significant 
financial maintenance, including water purifica¬ 
tion, clean air, pollination, soil decomposition, bio¬ 
diversity, and many others. In accounting, balance 
sheets often overlook these free services provided by 
Mother Nature. 

Finite resources In sustainability education, this 
involves those natural resources on planet Earth 
that require careful allocation into the future, such 
as fresh water, oil, and many others. 

NEON Acronym for a long-term environmental moni¬ 
toring program funded by Congress and mandated 
to include both research and education components, 
entitled “National Ecological Observatory Network.” 


Pedagogy The art, science, or profession of teaching 
and indirectly, the process of implementing assess¬ 
ments or outcomes as part of educational research 
in order to improve teaching. 

STEM Acronym describing the major science subjects 
for education advancement, which include “sci¬ 
ence, technology, engineering, and mathematics.” 
Sustainability When a system functions indefinitely 
with regard to present and future needs, such as 
when human and natural systems operate indefi¬ 
nitely through careful resource allocation. 

Virtual versus real In sustainability education, there is 
an underlying controversy between the importance 
of tools of teaching: newer techniques involve vir¬ 
tual tools (computers, handheld devices, simulated 
games, virtual field trips, and videos), whereas 
more conventional tools include real nature such 
as field trips, taxonomic collections, and hands-on 
natural science activities. Ideally, both virtual and 
real approaches contribute to effective learning. 

Definition of Sustainability Education 

► The best time to plant a tree is twenty years ago. The 
next best time is now. (African proverb) 

If every citizen could read the above quote and 
understand its underlying ecological concepts, eco¬ 
nomic challenges, social services, and spiritual heritage, 
then it is likely that sustainability education would be 
achieved. The notion of a tree and its ecosystem ser¬ 
vices illustrate sustainability in the simplest yet most 
robust sense. To plant and grow a tree, economists 
struggle with volatile currencies; ecologists juggle 
development and conservation; religious leaders advo¬ 
cate stewardship; and social scientists examine equity 
in a world of declining resources. Sustainability educa¬ 
tion requires an integrated approach between ecology, 
risk analyses, economics, social sciences, biological 
sciences, political sciences, languages, biotechnology, 
physical sciences, health sciences, and religion. 


Robert A. Meyers (ed.), Encyclopedia of Sustainability Science and Technology, DOI 10.1007/978-1-4419-0851-3, 
© Springer Science+Business Media, LLC 2012 
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All these practitioners (and many others) contribute 
to sustainability education, an emerging discipline that 
requires an interdisciplinary synthesis of knowledge, 
translated into practice, to insure the future of life on 
Earth. 

Introduction 

Science education, also termed STEM (science, tech¬ 
nology, engineering, and mathematics) is undergoing 
a revolution in America as well as overseas [1]. The 
clear links between our environment, human health, 
and economics have triggered a serious effort to 
improve science literacy into the twenty-first century. 
As global environmental setbacks including climate 
change, famine, and biodiversity extinction loom, 
Earth’s living systems are fast approaching a tipping 
point beyond which life on earth may cease to exist as it 
is known now [2]. Yet never before have humans had 
such high levels of technology to achieve solutions to 
global challenges. How to integrate these new technol¬ 
ogies with science education is one of the major prior¬ 
ities facing both local and national governments. 
Neighborhoods and communities face the challenges 
of air quality and fresh water; entire nations prepare to 
adapt to rising sea levels and volatile food prices due to 
droughts; and around the globe, nations struggle with 
increasing levels of carbon dioxide in our atmosphere 
and oceans [3]. All of these issues directly link to the 
quality of sustainability education, which translates 
into training the next generation to understand the 
links between a healthy environment, human health, 
a sound economy, and shared use of finite natural 
resources. 

It is daunting to link the future health of planet 
Earth with the importance of learning about how our 
Earth operates. But science or sustainability education 
in this sense has become a direct metric for our future 
survival. In America, professional groups including 
National Academy of Sciences, National Science Foun¬ 
dation, American Association for Advancement of 
Science, National Association of Environmental 
Educators, Boy and Girl Scouts of America, American 
Association of Museums, almost all scientific societies, 
and even grassroots organization such as Children and 
Nature Network, local Sierra Club chapters, and reli¬ 
gious networks have embraced the challenge of raising 


science literacy. Furthermore, science literacy is 
adopted here to umbrella many different aspects of 
STEM literacy. Diverse components of science literacy 
included under this umbrella range from marine liter¬ 
acy, environmental literacy, physics literacy, mathemat¬ 
ics literacy, and to ecological literacy. Sustainability 
goes one step further from the collective science liter¬ 
acies in that it requires the integration of both social 
and physical/biological sciences. But one common 
denominator emerges for all of these efforts relating 
to science or sustainability literacy - the notion of 
sustainability education embraces our knowledge of 
how to continue to live on planet Earth without 
exhausting our resources. 

Ultimately, sustainability education may be best 
achieved by a blend of hands-on (i.e., real) and virtual 
science education experiences. Building bridges 
between virtual and real environments, among scien¬ 
tists and citizens, and spanning ecology and economics 
were some of the central issues in recent summits and 
conferences of educators and other professionals in 
Washington, DC during 2009-2010. This chapter 
reviews some of the emerging informal science educa¬ 
tion priorities for integrating tools both virtual and 
real, ultimately to create effective sustainability 
education. 

Short History of Sustainability Education and 
Related Education Initiatives 

Over the past decade, myriad conferences, conversa¬ 
tions, workshops, and white papers have been pro¬ 
duced on the topic of science education, and on 
raising America’s STEM literacy as part of its global 
economic competitiveness [4-13]. It is well recognized 
that the conventional methods of large lectures, text¬ 
books, and short laboratory sessions are not able to 
cover the overwhelming body of scientific knowledge, 
and even worse, this format does not inspire today’s 
students to pursue careers in science or even to achieve 
a minimal level of scientific literacy. Educators and 
scientists alike recognize that innovative, new peda¬ 
gogies need implementation immediately, if America 
can regain her leadership in technology and innova¬ 
tion, especially in fields that relate to sustainability such 
as energy, conservation, biodiversity, ecosystem ser¬ 
vices, and environmental justice. This commitment 
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has gained center stage in editorials, publications, and 
conversations from the highest federal levels to the 
grassroots of small school PTAs; yet specific mecha¬ 
nisms for achieving a robust level of sustainability 
education have not yet been successfully documented. 
This remains a priority for both the current practi¬ 
tioners of STEM education, for educators, and for the 
stewards of natural resources, who recognize the need 
for interdisciplinary approaches in education to ensure 
sustainability for all. 

Ecological Literacy 

► There is one important difference between ecology 
and many other fascinating sciences and games: 
unsolved problems of chess, astronomy, or mathemat¬ 
ics will not change if we ignore them. Our activity or 
lack of activity can alter the state of ecology. (Lawrence 
B. Slobodkin 2003) 

Ecological or environmental literacy has been a 
cornerstone for both formal and informal science edu¬ 
cation [ 14] . In particular, the big challenge for ecology 
education is not a lack of information, but rather the 
need to provide the relevant context [15, 16] that will 
motivate the next generation of environmental scien¬ 
tists to collect, access, and interpret relevant informa¬ 
tion needed for ecological stewardship. A conceptual 
understanding of “healthy” ecosystems and related 
ecosystem services, ranging from food and energy to 
clean air and water, will be required - not only to serve 
as the foundation for sound economics, but also to 
sustain human health. Nature shapes, and is shaped, 
by communities where people live, and in this sense 
ecology education overlaps directly with sustainability 
education. When students bond with their natural 
surroundings, and then use virtual simulations to 
understand large-scale ecological processes and 
drivers, ecology education is achievable [17, 18]. 

During October 2010, a diverse group of organiza¬ 
tions hosted a sustainability education summit, enti¬ 
tled Environmental Literacy for a Sustainable World 
(www.esa.org/eesummit). The mission of this summit 
was not to create a white paper to mail out to partici¬ 
pants 6 months later, as is often the aftermath of con¬ 
ferences, but to create a decadal action plan embraced 
and activated by the participants to advance environ¬ 
mental literacy. One of the first notions of the summit 


was to agree that all the science literacies are closely 
related, and that the achievement of a sustainable world 
will only come about by raising the standards of eco¬ 
logical, environmental, and science literacy - or what¬ 
ever the label. An open letter to government leaders 
about the importance of sustainability education was 
one of the first outcomes (for, e.g., 17 January 2011, 
www.heraldtribune.com). Additional action items 
include: (1) Produce a common set of 5-15 key con¬ 
cepts and principles that link human and natural sys¬ 
tems for use by teachers and policy makers; (2) Engage 
communities to meet the challenges of an uncertain 
future; (3) Create a network of green-design built envi¬ 
ronments; create an “APP” (application for handheld 
mobile devices) that links personal behavior to the 
environment as an index of health; (4) Construct a 
“go-to” site for environmental materials, aka a technol¬ 
ogy infrastructure to inventory, synthesize and share 
best practices and programs; and others. These action 
items all have different timelines, with the ultimate goal 
of engaging diverse stakeholders in different elements 
of sustainability education by 2020. 

Conservation Literacy 

Like ecology, conservation biology represents another 
closely related discipline to sustainability, and hence 
the concepts and applications of an educational strat¬ 
egy will overlap. In the field of conservation biology, a 
lively debate about the creation of a conservation ethic 
started long ago with Aldo Leopold’s seminal book, The 
Sand County Almanac [14]. Leopold articulated a value 
system with regard to land, “That land is a community 
is the basic concept of ecology, but that land is to be 
loved and respected is an extension of ethics. That land 
yields a cultural harvest is a fact long known, but 
latterly often forgotten.” 

The Society for Conservation Biology defined its 
goals and objectives as, “the education, at all levels, 
preparatory and continuing, of the public, of biolo¬ 
gists, and of managers in the principles of conservation 
biology” [19]. Further, the Society’s education com¬ 
mittee articulated principles that seek to maintain 
Earth’s biological diversity, ecological integrity, and 
ecological health. But the practitioners of conservation 
biology admit that their approach is very passive, 

“Among conservation biologists the prevailing 
view of conservation education appears to rest on 
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a deep-seated belief that the basic function of conser¬ 
vation education should be to disseminate knowledge 
that scientists generate, essentially to transport infor¬ 
mation to the public and key groups in the expectation 
that it will eventually precipitate more appropriate 
conservation-related behaviors” [20]. This is not 
unusual, and most scientists have adopted a “speak 
and they will listen” approach to STEM education. 
But in a world of declining and finite resources, 
a tipping point is fast being approached in the educa¬ 
tion circles, whereby the methodologies need to dras¬ 
tically change in order to inspire a science-literate 
community as stewards of global health. Sustainability 
education - if launched with appropriate integration of 
virtual and real natural systems [21], facts and “oh- 
wow” responses, local and global perspectives, and 
interdisciplinary approaches [22, 23] - is critical to 
succeed. 

Biological Literacy 

As a biologist by training, I have tracked the notion of 
biological literacy as a component of science education 
throughout my career [ 1 ]. The world has witnessed an 
amazing explosion of knowledge in biology - ranging 
from genomics and nanotechnology to global climate 
change and how sea-level rise will impact Earth’s 
resources. 

The relevance of biology education may be the 
most focused of all the “science literacies” to date 
due to extensive support from NSF, AAAS, Associa¬ 
tion of American Medical Colleges (AAMC), and 
Howard Hughes Medical Institute (HHMI). In its 
most recent “conversation” that included over 500 
biological sciences faculty, administrators, students, 
and other practitioners, AAAS published a white 
paper called “Vision and Change” (www.visionand 
change.com) [1]. They summarized four academic 
concepts to implement change in biology education: 

(1) Integrate core concepts and competencies; 

(2) Focus on student-centered learning; (3) Promote 
a campus-wide commitment to change; and (4) Engage 
the biology community in the implementation of 
change. The outcome of the “conversation” was to 
encourage all biology educators to develop a coordi¬ 
nated and sustainable plan for implementation of 


principles that will improve the national standards of 
biology education. 

Specific working groups to advance biology educa¬ 
tion (also termed “new biology”) convened around 
different priorities needed for biology education: man¬ 
aging large, complex data sets; interdisciplinary 
approaches; making connections among different con¬ 
cepts and/or questions; and seek new inquiry-based 
learning activities rather than the conventional memo¬ 
rization and lecture-based approaches [ 1 ]. Advancing 
education in biological sciences directly impacts our 
nation’s ability to respond to recent challenges, ranging 
from climate change to safe drinking water [10-12]. 
The core literacy concepts were delineated to include 
evolution, structure and function, information flow, 
and pathways and transformations of energy/matter, 
systems [1]. Core competencies were defined as the 
abilities to apply the process of science, use quantitative 
reasoning; use modeling/simulation; tap into the 
interdisciplinary nature of science; communicate/ 
collaborate with other disciplines; and understand the 
relationship between science and society [1]. While 
these criteria were developed for biology education, 
many of the same elements would be relevant for sus¬ 
tainability education which concurrently embraces 
biology as well as other disciplines. 

Sustainability Education in the Twenty-First 
Century 

Achieving science literacy is vital in the twenty-first 
century. Sustainability education can be construed as 
that interdisciplinary platform whereby diverse mem¬ 
bers of global communities seek to understand the role 
of humans in their physical and social environments. 
As global environmental challenges loom at an unprec¬ 
edented magnitude, “tipping points” that threaten to 
irreversibly damage Earth’s living systems are fast 
approaching [16]. Yet, never before have humans had 
such a wealth of technology to achieve solutions at their 
disposal, allowing collaboration from virtually any¬ 
where in the world, drawing ideas from multiple disci¬ 
plines to process and analyze countless data points, and 
teaching the next generation to view our world in 
exciting, novel ways that inspire environmental 
stewardship. 



Earth System Environmental Literacy 


3205 



In our teaching, environmental scientists must seek 
to balance cellular versus organismal biology, virtual 
models versus real-time data, and science blended with 
policy. Future stewards will need skills in assessing, 
predicting, managing, and communicating the dra¬ 
matic ecological and societal risks and changes 
confronting us [1]. However, a major stumbling block 
in training the next generation of practitioners is the 
challenge of effectively integrating technology with in 
situ fieldwork. While most senior ecologists were 
inspired by their training in the field, younger scientists 
are more familiar with virtual ecosystems through 
gaming, social networking, and computer models, 
sometimes leading to so-called nature-deficit disorder. 
How can environmental practitioners blend “hands- 
on” fieldwork with “cutting-edge” technology? This 
conundrum is the subject of ongoing debate. Those 
born after 1980 spend more time indoors with elec¬ 
tronic devices than going outdoors to experience 
nature first hand [17, 18]. 

On a positive note, new programs are emerging that 
successfully integrate virtual and real environments. The 
National Ecological Observatory Network (NEON) will 
conduct continental-scale environmental monitoring 
in situ, and their large databases will be accessible to 
students, citizen scientists, and policy makers ([20], 
see also www.neoninc.org). At the North Carolina 
Museum of Natural Sciences in North Carolina, the 
new Nature Research Center, with its mission “to 
engage the public in understanding the scientific 
research that affects their daily lives,” will house state- 
of-the-art research laboratories, accessible to the pub¬ 
lic, a three-story Daily Planet immersion theater, 
broadcasting field science from remote sites via video- 
streaming, and dedicated virtual and real meet-the- 
scientist activities ([21], see also www.naturesearch. 
org) . These two examples illustrate the changing land¬ 
scape for ecology education, and how technology can, 
and must, advance environmental literacy. 

NEON 

The National Ecological Observatory Network will take 
field measurements by scientists, citizens, and students 
collectively to a new level [24]. This continent-scale 
research platform will conduct long-term monitoring 


in America’s major ecosystems, and create an impor¬ 
tant education outreach component as a parallel 
stream of the research program. Affectionately called 
the “ecologists’ Hubble Telescope,” this is the largest 
federally funded research project undertaken by the 
ecological community, and aspires to contribute data 
sets that will lead to best practices for solving environ¬ 
mental challenges in the future. Critical to NEON’s 
success is the sustainability education and outreach 
component that have the potential to change how 
policy makers as well as citizens learn about sustain¬ 
able solutions. 

Citizen science has burgeoned in the past decade, in 
part due to Internet and handheld technological 
devices, but also because of a growing awareness that 
sustainability education is a community responsibility. 
Almost 20 years ago, the Jason Expedition (www.jason. 
org) pioneered distance learning for middle school 
youth, who not only visited marine and rain forest 
canopy environments, but also collected ecological 
data to share with other schools (e.g., [25]). Today, 
myriad programs including Project Budburst (www. 
budburst.org) and Project Nestwatch (www. 
nestwatch.org) rely on citizens’ collecting data, which 
are collated at a central source for scientific use. 

Planning for NEON took over a decade with hun¬ 
dreds of ecologists providing input. Projected to 
become fully operational by 2016, NEON’s 20 field 
sites will be linked by high-capacity cyber infrastruc¬ 
ture with the capacity to make forecasts about environ¬ 
mental changes. Important for sustainability education, 
these issues include six of the National Academies’ grand 
challenges: biodiversity, climate change, ecohydrology, 
biogeochemistry, infectious disease, invasive species, 
and land use [26]. And even more critical to sustain¬ 
ability education is NEON’s capacity to involve not just 
scientists - but also the public, students, teachers, and 
policy makers - to use the NEON interface for 
downloading data and learning by doing. NEON also 
pledges to prioritize science communication to diverse 
audiences [24] and foster the links between human 
health and healthy ecosystems. Will NEON’s education 
vision be achieved, or will it become a pile of dusty 
papers on some university desks? Only time will tell, 
but NEON has the potential to advance sustainability 
education across an entire continent. 
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NRC 

The Nature Research Center (NRC) at the North 
Carolina Museum of Natural Sciences is a new, 
80,000-square foot wing of the existing 132-year-old 
Museum. While conventional museums focus on 
“what we know” about science, the new wing will 
engage the public in “how we know what we know” 
and in particular inspire the public to understand sci¬ 
entific research that affects their daily lives. The NRC 
goals are to conduct/communicate cutting-edge sci¬ 
ence, create exemplary role models to inspire diversity 
in the next generation of scientists, and integrate real 
and virtual science to engage youth about scientific 
discoveries. Four new labs will house scientists, educa¬ 
tors, technology experts, students, and volunteers - all 
working together to understand how science works and 
communicate it to a diverse audience. In addition, the 
notion of scientists and educators and technology geeks 
all working together on the floor is unique to the 
Museum world, and seeks to assimilate “citizen sci¬ 
ence” into all components of research and education 
at the NRC. Many partners will insure a robust content 
for science communication - joint positions with the 


UNC system, partnerships with NOAA, NASA, 
National Geographic, and North Carolina government 
agencies, will all contribute to the creation of innova¬ 
tive science communication. The NRC has four unique 
learning platforms, facilitating the assessment of how 
different audiences learn best: (1) The Daily Planet, 
a high technology, immersion theater linked through 
the Ethernet highway to all classrooms in North 
Carolina and ultimately, around the world; (2) Citizen 
science labs, where participation in data collection by 
volunteers will contribute to answering important 
questions about our environment; scientific laborato¬ 
ries with glass walls allowing visitors to be involved in 
the research; (3) Investigate labs, where related research 
will be undertaken that involves school groups or other 
visitors; and (4) Hands-on exhibits where visitors are 
immersed in how science works (Fig. 1). 

By “reinventing” the education pathways of con¬ 
ventional museums, the NRC hopes to inspire diverse 
students to become excited about science, and to 
inspire scientists to engage in successful education out¬ 
reach, which links directly to creating a higher level of 
sustainability education. 



Earth System Environmental Literacy. Figure 1 

North Carolina's new Nature Research Center integrates cutting-edge research, education and technology for diverse 
audiences to foster environmental literacy 
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Future Directions 

The future is ours by design, but there is a need to seek 
interdisciplinary solutions and then communicate 
them with one voice. When baby boomers think back 
to their childhood, they can probably recall a favorite 
tree house, scout camping trip, or neighborhood pic¬ 
nics. In today’s world, children are not allowed 
unsupervised time outdoors, due in part to parents’ 
distrust of issues ranging from strangers to global secu¬ 
rity. But knowledge of real nature is essential to raise 
levels of environmental literacy, and in turn achieve 
a higher degree of sustainability education. Harking 
back to the African proverb in the beginning of this 
entry, many children do not have a clear concept of 
what is required to sustain a tree. But knowledge of 
nature is their best weapon if young people are ulti¬ 
mately to make good decisions about personal health, 
climate change, and land-use management. 

Despite the gloom and doom notions surrounding 
environmental literacy and our human aspiration to 
attain a sustainable lifestyle, there are exemplary 
actions providing amazing examples of best practices. 
What are some of the diverse projects exemplifying 


sustainable education? Here is a selection of sustain¬ 
ability education activities, perhaps useful to imple¬ 
ment in your own community! 

1. Strategies for Ecology Education , Diversity and Sus¬ 
tainability (SEEDS). Diverse People for a Diverse 
Science. SEEDS is an education program of the 
Ecological Society of America (Fig. 2). Its mission 
is to diversify and advance the ecology profession 
through opportunities that stimulate and nurture 
the interest of underrepresented students to not 
only participate in ecology, but to lead. Focused 
mainly at the undergraduate level, with extension 
services for communities, high schools, graduate 
students, and international collaborations, the 
SEEDS program promotes an ecology profession 
with wide representation to ensure environmental 
understanding and a sustainable future for all. 
(www.esa.org/seeds) 

2. Urban Farming. In partnership with a neighboring 
community garden, Mill Creek Farm (MCF) 
transformed a vacant West Philadelphia lot into a 
thriving urban farm and intergenerational educa¬ 
tional center (Fig. 3). Now starting on their sixth 


E 



Earth System Environmental Literacy. Figure 2 

At the Ecological Society of America's annual meetings, SEEDS students are mentored to insure a diverse community of 
ecologists 
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Earth System Environmental Literacy. Figure 3 

Mill Creek Farm, near Philadelphia Pennsylvania, examplifies the growing popularity of sustainable urban farming 


field season, MCF engages and galvanizes the 
community by providing affordable and sustainable 
food to the surrounding neighborhoods, employing 
local high-school students, offering food justice 
education and much, much more. For more 
information (including a documentary and radio 
stories) visit: http://www.millcreekurbanfarm.org/ 

3. Sustainability Education for Incarcerated Youth. For 
19 years, high school science teacher DC Randle 
taught biology and environmental sciences to mid¬ 
dle and high school youth in the Minnesota correc¬ 
tion system. Defined as emotionally, behaviorally 
disordered students, they thrived under a curricu¬ 
lum that allowed them to rediscover nature. Using 
the natural resources around their facility, their 
science teacher provided a strong backbone for 
sustainability education to these at-risk youth. 

4. Local-to-Global Connections. As part of a fund 
raiser for a local canopy walkway, Meg Lowman 
started leading field trips to the Amazon for under¬ 
served youth in Florida. Called the Florida-Peru 
Connection, the local community funded scholar¬ 
ships for local Boys and Girls Club and Girls 
Inc. students. In return, young people have visited 
America from Amazonian villages along the Rio 


Napo in Peru. The link between local and global is 
an important concept for sustainability, and 
experiencing other ecosystems and culture first 
hand had a significant impact on the perspectives 
of both the Amazon and the American communi¬ 
ties who were part of the exchange. As a result of 
shared global views, both Florida and the Amazon 
share a canopy walkway for education, where chil¬ 
dren (and others) conduct research on the biodi¬ 
versity of forests (Fig. 4). More information is 
listed at www.treefoundation.og 

5. Student Outreach in Science (SOS). Environmental 
studies students at New College of Florida devel¬ 
oped sustainability outreach activities appropriate 
for elementary and middle school classes. They 
developed a skillset in science communication, 
and then took their lessons to underserved schools 
in the districts, providing a student-teaching- 
student model that was well received by both the 
teachers and the students. Units ranged from cli¬ 
mate change adaptation to growing local vegeta¬ 
bles to invasive species to clean energy. Over 
60 students at New College received the Sarasota 
County Conservation Education award based on 
their SOS program (www.canopymeg.com) 
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Earth System Environmental Literacy. Figure 4 

At North America's first public canopy walkway in Florida, third graders conduct original research on forest canopies 


6. Master of Arts in Sustainable Communities , North¬ 
ern Arizona University (http://home.nau.edu/sus/). 
Students are admitted from all kinds of back¬ 
grounds - from political science and sociology to 
science and engineering to humanities and busi¬ 
ness. Students build their own programs of 
coursework from the university’s very broad 
array of courses relevant to sustainability (of econ¬ 
omies and of societies as well as of natural 
resources and environment). More important, 
they each develop independent, community- 
based research/thesis projects. Many come to this 
program as working professionals while some are 
more conventional students coming straight from 
a bachelor’s degree. Graduates from this program 
are indeed transforming many communities 
around the intermountain west. Another unique 
program at NAU is the Professional Science Mas¬ 
ter’s in Climate Science and Solutions, http:// 
climatesciencesolutions.nau.edu/ 

7. Measuring the Sustainability of Food to Table. 
Professor David Cleveland at University of 
California, Santa Barbara engaged his environ¬ 
mental studies students in measuring the energy 
budget of regional agriculture, from farm to table. 


This agrifood systems analysis has the potential to 
change small-scale and local agriculture, and edu¬ 
cate students in sustainable practices at the same 
time. It also earned David the award of Campus 
Sustainability Champion [27]. http://es.ucsb.edu/ 
profile/cleveland 

8. Young Voices on Climate Change. Author Lynn 
Cherry, finding resistance to teaching climate sci¬ 
ence, is producing inspiring, empowering short 
films to break down those barriers. The Young 
Voices on Climate Change documents inspiring 
success stories of kids shrinking their carbon foot¬ 
print and fighting climate change. When climate 
change science is preceded with the showing of 
these films, youth and adults alike are more recep¬ 
tive to embracing the seriousness of environmen¬ 
tal change rather than denying. Young people 
listen to others their own age and, in these films 
their peers model the skills, motivation, and con¬ 
fidence necessary to reduce C0 2 emissions. (www. 
YoungVoicesonClimateChange.com) 

9. Sustainability between Connecticut and Jamaica. 
Eastern Connecticut State University recently 
had approved a new major in Sustainable Energy 
Studies - one interesting program associated with 
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students in this new major has been the service¬ 
learning project taking students to Jamaica to 
develop a small wind power system for the school 
in Lucea, Jamaica. The university has a long¬ 
standing relationship with Jamaican public 
schools because of our program to get their nor¬ 
mal school teachers to complete their bachelor’s 
degree in USA. This type of global experience in 
sustainability education is part of Eastern’s two 
Strategic Plan Initiatives - Liberal Arts Work and 
Global Citizenship. Check out: http://www. 
diversityweb.org/DiversityDemocracy/voll4no2/ 
loxsom.cfm?utm_source=pubs8uitm_medium= 
blast8mtm_campaign=divdemvoll4no2 
10. The Perennials Collaborative. The emerging gener¬ 
ation of sustainability leaders brings new ideas for 
sustainability initiatives. Samir Doshi at University 
of Vermont and colleagues have undertaken 
a collective, action-oriented community project to 
inspire “Perennials,” those visionary leaders who 
(like perennial plants) have roots in communities 
and become forces of vision and change. Doshi’s 
team is making a documentary, web site, and 
hosting community workshops throughout the 
country, especially in at-risk areas such as Appala¬ 
chia, to inspire more actions for sustainability. 
(www.perennialscollaborative.com). 

America is at a crossroads for education in science, 
technology, engineering, and mathematics (STEM). 
Currently, a company can hire nine Mexicans or eight 
Indian engineers at the same cost as one American; 
93% of middle school students learn physical sciences 
form a noncertified teacher; and Americans spend 
more on potato chips than the entire federal R&D 
budget for energy [6]. It is no wonder that America 
currently ranks 48th in terms of global STEM education. 
Recent reports by the National Academies of Science, 
Engineering and other focus groups have assessed 
America’s current STEM education, and listed actions 
that federal policy makers could take to enhance STEM 
and allow America to increase her global competitive¬ 
ness [5, 6]. Such actions include increasing America’s 
economy by vastly improving K-12 STEM education; 
doubling federal investments in STEM research to fuel 
our economy, provide security, and foster sustainabil¬ 
ity; and encourage citizen science to raise STEM 


literacy. Will these actions be implemented, or will 
the Academies’ report be overlooked? Only time will 
tell, but STEM and ultimately sustainability education 
collectively will invariably determine the quality of life 
for the next generation. 
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Glossary 

Aerodynamic resistance Or drag or aerodynamic 
drag. The component of force exerted by the 
air on a liquid or solid object that is parallel and 
opposite to the direction of flow relative to the 
object. 

Evapotranspiration A term used to describe the sum 
of evaporation and plants’ transpiration from the 
Earth’s land surface to the atmosphere. 

LAI The ratio of total upper leaf surface area of vege¬ 
tation divided by the surface area of the land on 
which the vegetation grows. LAI is a dimensionless 
value, typically ranging from 0 for bare ground to 6 
for a dense forest. 

Land surface processes The various biogeophysical 
and biogeochemical processes occurring within 
and over various land surface components and 
interacting with the atmospheric processes. 

Parameterization A method or process to approxi¬ 
mate the effect on the model resolved processes 
that would be generated by unresolved subgrid- 
scale processes using formulations containing 
empirical parameters. 

Stomatal resistance The opposition to transport of 
quantities such as water vapor and carbon dioxide 
(C0 2 ) to or from the stomata of plant leaves. 

Subgrid heterogeneity The spatial variation in land 
surface properties, for example, soil wetness, snow, 
vegetation, and terrain within an ESM grid cell, 
which is on the order of ~ 100 km. 

Tile method A method dealing with subgrid hetero¬ 
geneity. It gathers similar homogeneous land 
patches at different geographic locations within an 
ESM grid-cell into a number of “tiles” or “mosaics”; 
computes surface fluxes over each “tile” using dis¬ 
tinct parameters of each tile while sharing the same 
atmospheric forcing; and then averages the com¬ 
puted fluxes from the tiles, weighted by the frac¬ 
tional areas of each “tile.” 

Topographic index Or “wetness index,” = ln(a/tan/?), 
where a is the specific catchment area, that is, the 
upstream area above a pixel that drains through the 
unit contour at the pixel, and tan/? is the local 
surface topographic slope. A greater value corre¬ 
sponds to a pixel of lowland areas that are poten¬ 
tially wetter. 
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Definition of the Subject and Its Importance 

Land covers about 29% of the Earth’s surface and 
represents an important component of the Earth’s cli¬ 
mate system. The land surface encompasses soil, snow, 
vegetation, glaciers, urban, lakes and rivers, mountains, 
etc. Land surface processes refer to various 
biogeophysical and biogeochemical processes occur¬ 
ring within and over these components and interacting 
with the atmospheric processes. Land surface models 
(LSMs) are mathematical descriptions of the land sur¬ 
face processes, with particular attention to momentum, 
energy, and mass (water, carbon, dust, and other con¬ 
stituents) flux exchanges with its overlaying atmo¬ 
sphere; these fluxes are intimately related to their 
storages in these components. In the context of an 
earth system model (ESM), LSMs are designed to pro¬ 
vide the atmospheric model with lower boundary con¬ 
ditions over global land areas, describing the 
interactions of the land surface with the atmosphere. 
In addition, LSMs provide ocean models with dis¬ 
charges of water, sediments, and solutes. In this 
chapter, the focus is on descriptions of processes related 
to momentum, energy, water, and carbon, while pro¬ 
cesses related to other mass constituents, like dust, sed¬ 
iment, solute transports will not be included. Readers 
can also further refer to reviews of land surface modeling 
such as Sellers et al. [1], Pitman [2], and Yang [3]. 

Most LSMs, if not all, for use in ESMs and opera¬ 
tional weather forecasting models are vertically one¬ 
dimensional (ID) models solving soil temperature and 
moisture equations, while parameterizing the effects of 
lateral flows of surface and subsurface water driven by 
subgrid topography. Although three-dimensional (3D) 
hydrological models numerically describing lateral 
flows of surface and subsurface water and operating at 
a resolution of meters have existed for years, it is not 
practical to include them in a climate model due to 
their expensive computational cost. Vertically, an LSM 
extends down to a few meters of the soil (2-6 m in 
different models) and a few meters above the vegeta¬ 
tion canopy or the urban canopy. Some LSMs extend 
their depth to tens of meters deep to consider the long¬ 
term effects of permafrost and unconfined aquifers [4] . 

LSMs describe processes at various spatial scales 
ranging from millimeter to continental scales. The 
energy flux exchanges between land surface and the 


atmosphere involve processes at various scales from 
scattering of photons by plants and latent and sensible 
heat transfer by surface layer small eddies to convective 
planetary boundary layer large eddies. The water flux 
exchange involves processes at scales from water infil¬ 
tration and percolation through soil micropores, tran¬ 
spiration through plant stomata, hillslope overland 
flow to continental-scale hydrological processes. The 
subgrid, unresolvable small-scale processes are highly 
parameterized using empirical equations based on 
a basic understanding of the processes, but not 
attempting to represent the process by equations 
representing first principles. This is why LSMs are 
also referred to as land surface parameterization 
schemes. 

The land surface processes can influence weather 
and climate at timescales ranging from seconds to 
millions of years. For instance, the convective atmo¬ 
spheric boundary layer is developed in response to land 
surface heating on timescales from seconds to hours. 
Atmospheric temperature and precipitation can be 
affected through changes in soil water and groundwa¬ 
ter, seasonal variations in snow cover and frozen soil, 
and vegetation phenology on a scale ranging from days 
to seasons. Temperature and water availability can 
greatly influence vegetation structure and function 
from years to centuries [5]. The Earth’s climate has 
been tightly coupled to atmospheric C0 2 levels 
through the carbonate silicate cycle and/or the organic 
carbon cycle on geological timescales spanning hun¬ 
dreds of millions of years [6]. 

Land surface processes are critical for understand¬ 
ing climate dynamics owing to its momentum, energy, 
and mass exchanges with the overlying atmosphere. 
Land surface can “memorize” climate dynamics by 
recording and filtering the signals of weather events 
through storages of energy, water, and carbon and 
feed back to the atmosphere through surface fluxes. 
Pitman [2] reviewed coupled model experiments 
including sensitivity studies of changing albedo (e.g., 
[7]), roughness length (e.g., [8]), LAI [9], water¬ 
holding capacity [10], and roots [11], etc., deforesta¬ 
tion [12], desertification [13], land cover change (e.g., 
[14]), ecosystem feedback under doubling C0 2 [15], 
and vegetation-snow-albedo feedback [16]. These sen¬ 
sitivity experiments provided evidence that climate 
models are sensitive to the land surface processes [2]. 
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In addition, land surface feedbacks may amplify and 
prolong sea surface temperature (SST) anomaly trig¬ 
gered droughts, playing a critical role in the persistence 
of droughts and mega-droughts [17, 18]. It is well 
known that global climate dynamics is greatly affected 
by ocean-atmosphere interactions. However, land sur¬ 
face processes are of great importance for seasonal, 
interannual, and decadal climate change over specific 
land areas. Koster et al. [19] revealed that land and 
ocean processes have essentially different geographic 
regions of influence on precipitation variance, that is, 
the ocean s influence is dominant mainly in the tropical 
land, while the land’s influence is dominant over mid¬ 
latitude land areas. Koster et al. [20] identified the arid- 
to-wet transition regions over land as “hot spots” 
where changes in surface water fluxes associated with 
soil water storage have greater impacts on 
precipitation. 

The land surface is more heterogeneous than other 
Earth surface components. Within one grid-cell of an 
ESM on the order of ~100 km, different land surface 
types, for example, bare soil, snow, grass, crop, and 
forests, may coexist. An LSM has to parameterize the 
integral effects of these subgrid surfaces on surface 
energy, water, and carbon fluxes over the grid cell. 
A variety of methods are used, for example, effective 
parameters, “mosaic” (or tile) methods, probability 
density function (pdf) methods, or finer grids than 
the host climate model. LSMs are usually community 
efforts, because they are inherently interdisciplinary 
including atmospheric sciences, hydrology, ecology, 
soil and snow physics, etc., as well as social sciences 
related to land use and water use managements. 

Introduction 

LSMs have evolved for three generations [1,2] since the 
advent of the first generation global climate models 
(GCMs). The first LSM was the “bucket model,” 
which describes the land surface as a bucket with 
a fixed water holding capacity of 15 cm [21]. The 
bucket can be filled with precipitation and emptied by 
evaporation. Runoff is generated as the water that 
exceeds the bucket holding capacity, and evaporation 
is limited by the water content of the bucket. In the first 
generation models, the transfer of sensible and latent 
heat was based on simple aerodynamic bulk transfer 


formulations, using uniform and prescribed surface 
parameters, for example, water holding capacity, 
albedo, and roughness length. These models did not 
account for ecological processes (e.g., transpiration 
through plants’ stomata) and more detailed hydrolog¬ 
ical processes (e.g., infiltration and capillary upward 
flow through soil micropores). Despite their simplifi¬ 
cations, the first generation models represent a key step 
to describe land surface processes in ESMs, and their 
simulated soil moisture and evapotranspiration (ET) 
were comparable to more complex models at longer 
timescales. 

From the 1980s to early 1990s, a second generation 
of models appeared. They explicitly represented vege¬ 
tation effects and more complex soil hydrology in 
the calculation of surface energy and water balances 
[22-24]. Representative second-generation models are 
the Biosphere-Atmosphere Transfer Scheme (BATS) 
[23] and the Simple Biosphere Model (SiB) [24]. Dur¬ 
ing this period, two dozen second-generation models 
were developed with emphasis on various specific pro¬ 
cesses (e.g., [25-28]). These models vary in detail, but 
they have many common features. They usually have 
a number of soil layers, a single canopy layer (or an 
additional understory canopy layer like in SiB), and 
a bulk snow layer. Micrometeorological, hydrological, 
and ecological processes are more explicitly represented 
in these models. Instead of the fixed surface albedo in 
the first-generation models, more explicit radiation 
transfer through the vegetation canopy differentiating 
visible and near-infrared and beam and diffusive lights 
was taken into account. Turbulent transfer of heat from 
multiple surface sources, for example, canopy leaves 
and the ground surface was considered, and the 
under-canopy and leaf boundary layer turbulences 
were parameterized based on atmospheric boundary 
layer similarity theories. Transpiration through plants’ 
stomata were considered using the Jarvis-type stomatal 
conductance scheme [29], a simple empirical formula¬ 
tion as a function of light (photosynthetically active 
radiation; PAR), water (leaf water potential), and envi¬ 
ronmental conditions (temperature and humidity). 
Whether the soil moisture Richards equation is solved 
or not, vertical gravity and capillary flows of water were 
taken into account. Runoff was parameterized as 
various functions of soil water storage or excess 
above infiltration, neglecting topographic effects. 
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Accumulation and ablation of snow on the ground 
were represented with a bulk layer of snow, neglecting 
snowpack internal processes. 

Since the early 1990s, third-generation models have 
been developed. The third-generation models were 
mainly characterized by the capability of simulating 
carbon uptake by plants and vegetation dynamics, or 
“greening” of the land surface [1, 2]. Other aspects of 
the model were substantially improved at the same 
time. Some models also began to incorporate treat¬ 
ments of nutrient dynamics and biogeography, so that 
vegetation systems can change location in response 
to climate shifts. Collatz et al. [30] and Sellers et al. 
[31] began to integrate stomatal conductance 
and photosynthesis into LSMs based on the work of 
Farquhar et al. [32]. A semiempirical model of leaf 
conductance was proposed [30] based on the under¬ 
standings of the limitations on carbon assimilation by 
leaf and maximum use of water by plant [33]. 

During the same time period, LSMs were substan¬ 
tially improved in other aspects with regard to repre¬ 
sentations of hydrological processes in ESMs. LSMs 
started to account for subgrid variability in a more 
explicit way, for example, the “tile” approach to repre¬ 
sent vegetation patchiness [34, 35] and the statistical 
approach to represent subgrid distribution of soil 
moisture and runoff generation [36], spatial variability 
of infiltration [37], topographic control on subgrid soil 
moisture distribution and runoff generation [38, 39], 
and subgrid snow distributions [40, 41]. LSMs started 
to include more physically based, multilayer snow 
submodels to accommodate more internal processes 
by parameterizing growth of grain size and liquid 
water retention and percolation within snowpack 
[42-44]. Ice content within soil (frozen soil) was 
explicitly resolved as a new prognostic variable, and 
its impacts on runoff and infiltration were investigated 
[45-47]. Most recently, groundwater processes were 
implemented in LSMs [48-52], and LSMs entered 
a new era involving multiple disciplinary sciences. 

Major Processes Represented in an LSM 

Subgrid Heterogeneity and Surface Data 

Within an ESM grid on the order of ~100 km, land 
surface properties, for example, soil wetness, snow, 
vegetation, and terrain, vary at the scale of a few meters. 


Because of the strongly nonlinear dependence of 
surface fluxes on surface properties and states, for 
example, the dependence of ET on soil wetness, simple 
area-averaging approaches used to aggregate surface 
properties and states is problematic. The use of LSM 
grids whose resolution is higher than ESMs may par¬ 
tially improve the accuracy of grid cell averaged fluxes 
[53] but it is still very difficult to explicitly resolve the 
variability. Therefore, representing subgrid variability 
within ESMs will always remain a challenge no matter 
how fine the model resolution is [36, 54]. There are 
three major approaches to representing the subgrid 
heterogeneity, that is, effective parameters, “mosaic” 
or “tile” approach, and statistical approach. 

Earlier LSMs used effective parameters to obtain the 
correct fluxes averaged over patches of a grid cell. Some 
effective parameters can be obtained by simply averag¬ 
ing the parameters for various patches, e.g., albedo, 
because of the linear relationship between radiative 
fluxes and surface albedo. But other effective parame¬ 
ters, which have a nonlinear relationship with surface 
fluxes, for example, the effective roughness length, can¬ 
not be obtained by simply averaging parameter values 
of individual patches and need to be derived in such 
a way that correct area-averaged fluxes [55-57] can be 
ensured. 

The statistical approach uses pdf of input variables 
to derive grid cell averaged runoff and ET. Assuming 
point precipitation intensity and soil wetness are spa¬ 
tially distributed over a large ESM grid cell according to 
exponential and gamma distributions respectively, 
Entakhabi and Eagleson [36] derived grid cell mean 
surface runoff ratio (to grid cell mean precipitation), 
bare soil evaporation efficiency (ratio of actual to 
potential), and transpiration efficiency (ratio of actual 
to potential evaporation). Liang et al. [37] 
implemented spatially variability of infiltration capac¬ 
ity in the Variable Infiltration Capacity (VIC) model. 
Niu et al. [39] used an exponential distribution of 
topographic index (or wetness index) to parameterize 
fractional saturated area and saturation excess runoff. 

The tile approach [34, 35] gathers similar homoge¬ 
neous land patches at different geographic locations 
within an ESM grid cell into a number of “tiles” or 
“mosaics,” computes surface fluxes over each “tile” 
using distinct parameters of each tile while sharing 
the same atmospheric forcing, and then averages the 
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computed fluxes from the tiles, weighted by the frac¬ 
tional areas of each “tile .” The performance of the tile 
approach has been evaluated in numerous offline stud¬ 
ies (e.g., [35]), boundary layer or mesoscale atmo¬ 
spheric models (e.g., [34]), and coupled ESMs 
[26, 58]. The tile can be also generated using different 
variables, for example, topographic index [38], soil 
wetness [54], or soil ice content [59]. The tile approach 
may overlap too much canopy shadows and thus 
induce errors in computing net solar radiation 
absorbed by the ground, especially over high latitude, 
boreal forest regions [60]. However, this approach is 
widely used in numerous second- and third-generation 
LSMs, because of its readiness to make full use of high- 
resolution satellite remotely sensed land surface data to 
generate the “tiles.” 

As an example, the National Center for Atmo¬ 
spheric Research (NCAR) Community Land Model 
version 4.0 (CLM4) [61] adopted the tile approach 
and parameterized some key processes using the sta¬ 
tistical approach. CLM4 accounts for the subgrid het¬ 
erogeneity as a nested subgrid hierarchy of landunits, 
soil/snow columns, and plant functional types (PFTs). 
Each grid cell can have a number of different landunits, 
for example, glacier, wetland, lake, urban, natural veg¬ 
etation, and managed vegetation (crops). Each 
landunit can have a number of different columns 
(e.g., irrigated and rainfed soil), and each column can 
have multiple PFTs. CLM4 adopted the statistical 
method of Niu et al. [39] to parameterize surface sat¬ 
urated area and associated runoff. Surface datasets to 
feed CLM4 include percentages of glacier, lake, wet¬ 
land, urban, and various PFTs to determine the frac¬ 
tional area of these surface tiles. PFTs are based on 
MODIS satellite data at 30 s (~1 km) resolution [62]. 
Prescribed PFT LAI was derived from the MODIS 
satellite data of Myneni et al. [63]. Prescribed PFT 
stem area index (SAI) is derived from PFT LAI phenol¬ 
ogy combined with the method of Zeng et al. [64]. 
Prescribed canopy top and bottom heights are from 
Bonan et al. [65]. Soil color dataset (affecting surface 
albedo) is based on Lawrence and Chase [62]. The 
IGBP soil dataset (Global Soil Data Task 2000) of 
4,931 soil mapping units and their sand and clay con¬ 
tent for each soil layer is used to create a mineral soil 
texture dataset [65] and an organic matter density 
dataset [66] that vary with depth. 


Surface Energy Balance 

Over an ESM grid cell, the net radiation (R n ) absorbed 
by the land surface is balanced by latent (IF), sensible 
(H), and ground heat fluxes (G): 

R n = LE + H+G (1) 

Note that the chemical energy stored during pho¬ 
tosynthesis and released by respiration is not included 
in the above equation as it amounts to less than 1% of 
the absorbed insolation [2]. R n includes net shortwave 
and net longwave radiation fluxes. LSMs that do not 
differentiate the vegetation canopy and the ground, for 
example, Noah [67] use the above equation and com¬ 
pute a single temperature for the land surface. More 
complex models that have a separate canopy layer from 
the ground, like versions of SiB and CLM, use two 
energy balance equations, one for the vegetation can¬ 
opy and another for the ground surface, to solve their 
temperatures. The partitioning of available energy ( R n ) 
into latent and sensible heat fluxes represented by Eq. 1 
is greatly controlled by the land surface hydrological 
conditions. For an LSM using the tile subgrid 
approach, Eq. 1 is applied over each tile. 

Net Shortwave Radiation LSMs compute net short¬ 
wave radiation over various land surfaces: 

S fl = S(l~a s ) (2) 

where S is the incoming solar radiation from the host 
ESM, and a s is the surface albedo, varying with solar 
zenith angle (SZA), vegetation types, snow cover, and 
soil moisture conditions. In earlier LSMs, a s was pre¬ 
scribed for various land surface types. More complex 
second- and third-generation models used Beer’s Law 
and two-stream approximation scheme [68, 69] to 
compute radiative transfer through the vegetation can¬ 
opy, net radiation absorbed by the vegetation canopy 
and the ground, and the surface albedo. 

Beer’s law assumes that irradiance decreases expo¬ 
nentially with path length through an absorbing 
medium. Thus, the radiation reaching the ground, 
S g = Se^~ KLAI \ where k is a constant attenuation coef¬ 
ficient, and LAI is the effective leaf area index of the 
canopy. However, Beer’s law does not account for the 
scattering and multiple reflections by the canopy 
leaves [70]. 
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The two-stream approximation [68, 69] accounts 
for beam and two main streams (vertical upward and 
downward) of diffusive radiation over two wavebands 
(visible and near-infrared). However, it assumes that 
the canopy leaves are evenly distributed within a grid 
cell. On the basis of complex radiation transfer models 
[71], Yang and Friedl [72] proposed a modified two- 
stream scheme to account for aggregation of the evenly 
distributed leaves represented by a two-stream scheme 
into canopy crowns with between-canopy and within- 
canopy gaps, which vary with radius and thickness of 
the canopy, tree density (the distance between trunks), 
and SZA. The scheme needs prescribed optical param¬ 
eters, for example, reflectivity and absorptivity of the 
vegetation and ground, for various land surface types. 
The two-stream approximation also computes the frac¬ 
tion of sunlit and shaded leaves, which are crucial to 
photosynthesis processes. 

Net Longwave Radiation The net longwave radia¬ 
tion (or infrared) is computed as the residual of the 
incoming (L^j and emitted by the surface: 


L a = e{L x - oT?) (3) 

The upward longwave radiative flux emitted by the 
surface is based on the Stefan-Boltzmann Law. In the 
above equation a is the Stefan-Boltzmann constant, 8 is 
the emissivity of the land surface, and T s is the surface 
skin temperature. For a model that has a canopy layer, 
computation of L a becomes more complicated due to 
the interactions between the ground surface and its 
overlaying canopy [61]. 

Sensible and Latent Heat Fluxes The turbulent 
transfer of latent and sensible heat fluxes in most 
LSMs is based on Monin-Obukhov similarity theory 
[73, 74] . LSMs compute surface sensible and latent heat 
fluxes using the bulk transfer scheme: 


H — pC p Chu(Tq — T a ) 

( 4 ) 

LE = ^ Cwu(e o — e a ) 

y 

( 5 ) 


speed at the lowest layer of the host ESM, and T 0 and e 0 
are the temperature and water vapor pressure at the 
height of the surface roughness length, z 0 . The first- 
generation models prescribed C H and C w with different 
values for various land surfaces. The second- and third- 
generation models calculate C H and C w according to 
Monin-Obukhov similarity theory, accounting for sur¬ 
face stability conditions. For simplicity, C H = C w and is 
assumed to be equivalent to that for momentum C D in 
some LSMs. More elaborate schemes differentiate C H 
(=C W ) from C D using roughness lengths different for 
heat z 0H and momentum z 0 ; and usually z 0 > z 0H , 
because the transfer of momentum is more efficient 
(through pressure and diffusion) than those of heat 
and mass (through diffusion only). The value of 
ln(z 0 /z 0H ) is about 2.0 over a range of vegetation sur¬ 
faces, and in practical applications, z 0 /z 0H = 10 [75]. In 
analogy with Ohm’s law in electricity, the drag coeffi¬ 
cient can be transformed to an aerodynamic resistance: 

fa, h = fa : w =* (C H u) 1 (6) 

Shuttleworth [75] provided a review on formulations 
of the aerodynamic resistances for estimating surface 
ET and stability correction. 

Complex models compute sensible and latent heat 
fluxes from multiple sources of the land surface such as 
soil, snow, plants’ leaves, and stomata using equations 
similar to Eqs. 4 and 5. Over the soil surface, because of 
the strong temperature gradient between the soil skin 
(at 0 m) and z 0 as well as the difficulty in computing the 
temperature at z 0 , LSMs use the skin temperature T s to 
compute sensible and latent heat fluxes for conve¬ 
nience. The skin temperature over these surfaces can 
be computed iteratively using Eq. 1 till the surface 
energy is balanced. Because of the use of T s rather 
than T 0 in Eqs. 4 and 5, an extra resistance, r 50Z /, is 
needed [76]: 


H = pCp 


fa,H T T soil 


pCp(eXTs ) ~ fa) 

y r„ tW + r soii 


( 7 ) 

( 8 ) 


where p is air density, C p is the specific heat of dry air, y 
is the psychrometric constant, C H and C w are the tur¬ 
bulent exchange coefficients for heat and water vapor, 
respectively, T a and u are the air temperature and wind 


where e* ( T s ) is the saturated vapor pressure at T 5 , and 
r S0 u accounts for the resistance to water vapor and heat 
transfer from the surface soil pore space to z 0H (rough¬ 
ness sub-layer) through molecular diffusion. Some 
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LSMs also added a relative humidity term before e* ( T s ) 
to estimate the water vapor pressure within surface soil 
pore space. Sakaguchi and Zeng [77] reviewed numer¬ 
ous soil resistance schemes. This expression for soil 
surface evaporation is similar to that for transpiration 
through leaf stomata 


pC p (e*(T v )-e a ) 
y G,v + r c 


(9) 


where T v is the vegetation canopy temperature, r UfV is 
the leaf boundary layer resistance per unit LAI [73], 
and r c is the canopy-scale stomatal resistance that is 
scaled up from the stomatal-scale stomatal resistance, 
r sf , using LAI: r c = rJLAI. Most second-generation 
models implemented the Jarvis-type stomatal resis¬ 
tance [29], which relates r st to multiplicative stress 
factors: 


l/r* = gofs(PAR)f D ( Ac)/ T (r fl )/ W (^) (10) 

where g 0 is a vegetation type-dependent maximum 
stomatal conductance and f s , /b, /r> and /w> all °f 
which range from 0 to 1, are environmental stress 
factors associated with PAR, water vapor deficit of the 
surrounding air (Ac), air temperature (TJ, and leaf 
water potential that relates to root-zone soil moisture, 
respectively. 


Ground Heat Flux The ground heat flux is controlled 
by the soil thermal diffusion (or conduction) process 
and can be expressed as 


G = 


-A 


T]_ — T s 
Azi/2 


(ii) 


where A is the soil thermal conductivity, T x is the 
temperature of the first soil layer, and Az\ is the layer 
thickness of the first soil layer. By definition 
(G = —A(dT/dz) z=0 ), the smaller Azi, the more accu¬ 
rate G is. In case of soil covered by snow, LSMs that 
have a bulk snow layer replace AzJ2 in the above 
equation with (AzJ2 + /z snow ), where h snow is the 
snow depth. Thus, when the snowpack becomes 
thicker, G becomes more inaccurate. This is one 
of the major reasons that complex models use a multi¬ 
layer structure for the snowpack. G is then used as the 
upper boundary condition of soil temperature 
equation. 


Heat Storage in the Soil and Snow The soil thermal 
conduction (or diffusion) equation is written as 



d 

dz 


a dT ) 


( 12 ) 


where G and A are the volumetric heat capacity and 
thermal conductivity, respectively. LSMs solve the 
above equation using various methods, for example, 
the more efficient force-restore method [22, 78] as used 
in BATS and SiB and the finite difference method as 
used in the Noah and CLM4 models. 

The force-restore method is based on an analytical 
solution of the above equation under two assumptions 
that the surface forcing is periodic and the soil thermal 
properties are uniform. It is named after the two terms 
controlling surface temperature, that is, the forcing 
term (G in Eq. 11 ) and a restoring term due to heat 
diffusion from surface to a deeper soil layer. Recogniz¬ 
ing that the two assumptions are not always valid, 
Dickinson [78] took into consideration the 
nonuniform thermal properties induced by snow and 
soil moisture and generalized the force-restore method. 
However, the surface temperature solved through the 
force-restore method is not so responsive to variations 
in surface forcing with a frequency higher than diurnal 
(e.g., cloud effects on solar radiation) as the iterative 
method solving the surface energy balance equation 
(Eq. 1). Therefore, it may cause difficulties in resolving 
the available energy for snowmelt, which occurs at a 
sub-daily scale [60]. 

Current LSMs explicitly solve Eq. 12 using finite 
difference methods for layered soil and snow. The 
upper boundary condition is the surface forcing (G), 
and lower (bottom) boundary condition is either spec¬ 
ified as a zero flux as used in CLM or as the long-term 
averaged surface air temperature at a deep soil layer 
(e.g., 8 m) as used in Noah [67]. The zero flux condi¬ 
tion is more readily satisfied when the bottom layer 
is deeper. The accuracy of the solution increases as 
the number of layers increases. Due to the computa¬ 
tional cost, the number of layers cannot be very 
large. CLM has ten soil layers, while many other 
LSMs have three to five layers. To account for 
the thermal inertia of deep ground, the number of 
soil layers of CLM4 is changed from 10 to 15 layers, 
extending the total depth of the soil column down 
to 42 m [4]. 
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When covered by snow, the soil temperature is 
solved together with the temperature of snow of 
a varying number of layers [43, 60]. The layering of 
snowpack and the number of layers vary with the total 
snow depth and is more complex than layering soil 
(e.g., [43, 61]). For frozen soil, a source/sink term due 
to phase change, p ice Lfd0i ce /dt , where p ice) Lf , 6 ice , and 
t are density of ice, heat for fusion, volumetric ice 
content, and time, respectively, is added to the right- 
hand side of Eq. 12. This term accounts for the heat 
released by freezing liquid water or heat consumed to 
melt ice. It is usually combined with heat conductivity 
C and named as “apparent heat capacity,” C a , 

C.= C-„ ta L f ^ (13) 

Surface Water Balance 

The temporal variation of the water storage (S) in 
a grid cell of an ESM is controlled by the water incident 
on land surface, that is, precipitation (P) and the water 
leaving the grid cell, that is, ET and runoff ( R ): 

^ = P-ET-R (14) 

at 

The above equation neglects the lateral surface and 
subsurface flows that transport water to adjacent grid 
cells. The storage term in the above equation includes 
water (ice and liquid water) intercepted by the canopy, 
snow on the ground, water in the soil and aquifers. ET 
includes ground evaporation, canopy evaporation, and 
transpiration, which can be derived from their 
corresponding latent heat fluxes by dividing the heat 
of vaporization, L. For most LSMs, R includes surface 
runoff and subsurface runoff, which will be discussed 
later. 

Canopy-Intercepted Water Although the amount of 
the canopy-intercepted water is relatively small com¬ 
pared to other water storages, it plays an important role 
in contributing to the total ET and altering the canopy 
reflectance by intercepting snowfall. Most LSMs takes 
into consideration liquid water interception by 
the vegetation canopy, while few LSMs explicitly con¬ 
sider additional processes of loading and unloading of 
snowfall [79, 80]. The temporal variation of the liquid 
water stored on the vegetation canopy, W Cy u q , is 


determined by the rates of interception ( Ri ntr ), evapo- 
ration/dew (L c ), and drip ( Rdrip)' 

d\V c lin 

= Rintr -E c -R dnp (15) 

The interception rate is parameterized as a function 
of LAI and SAI. The interception capacity of the vege¬ 
tation canopy is also related to the subgrid distribution 
of vegetation and precipitation. Considering only 
a fraction of the vegetation canopy within the large 
area of an ESM grid cell receives precipitation, CLM4 
introduced a fractional area of precipitation as 
a limiting factor (0.25) to its interception rate [61]. 
However, this factor should vary with the distribution 
of precipitation that relates to the ratio of convective 
precipitation to total precipitation [39] and should 
be further investigated using high-resolution precipi¬ 
tation data. 

The interception capacity for snow is estimated to 
be about 50 times greater than for rain [80]. Intercep¬ 
tion of snowfall by the vegetation canopy significantly 
reduces snow mass on the forest floor. About 30-40% 
of the annual snowfall over complete coniferous cano¬ 
pies sublimates from the canopy and thus never reaches 
the ground [80]. Depending on meteorological condi¬ 
tions, the intercepted snow may fall to the ground, 
sublimate, melt, or refreeze. Implementing a snowfall 
interception model and properly representing the sub¬ 
limation of the canopy-intercepted snow can signifi¬ 
cantly improve the simulation of snow on the ground 
[81]. More importantly, properly representing inter¬ 
ception of snow can significantly improve the simula¬ 
tion of surface albedo. However, most LSMs do not 
explicitly distinguish between solid and liquid phases of 
water on the canopy surface, and they use the same 
interception capacity for both snowfall and rainfall. 

Based on the water stored on the vegetation canopy, 
a wet fraction of the canopy can be computed. Many 
LSMs (e.g., BATS and CLM) calculate the wet fraction 
as the 2/3 power of the ratio of canopy water to its 
maximum value, following Deardorff [22], while 
others (e.g., SiB) compute it as the ratio of the canopy 
water storage to its maximum value when the satura¬ 
tion vapor pressure at leaf temperature is less than the 
vapor pressure of the surrounding air. The maximum 
value of the canopy water is parameterized as 
a function of LAI. 
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Snow Water Snow cover is an important part of the 
climate system because of its high albedo, low thermal 
conductivity, and ability to absorb heat when melting. 
In addition, the snowmelt water is the primary source 
of stream flow and groundwater recharge in cold and 
alpine regions. The impacts of snow-cover processes on 
global and regional climate have been investigated by 
numerous researchers (e.g., [82-84]) using earlier 
GCMs. In addition, numerous sensitivity studies 
using GCMs found that the removal of all forests 
north of 45°N would lead to cooling and delay 
snowmelt because of the increased surface albedo 
(e.g., [12, 85]), indicating the important role of forest 
canopies in modifying the surface energy budget and 
climate over snow covered regions. 

Snow covered area exhibits spatial and temporal 
fluctuations ranging from 7% to 40% seasonally in 
the Northern Hemisphere [86]. Snow cover extent 
in the Northern Hemisphere has been decreasing 
since the mid-1980s in response to a global warming 
trend [87, 88] . On the other hand, Arctic warming may 
be accelerated by decreases in snow cover due to the 
positive snow-albedo feedback. Chapin et al. [89] 
reported that Arctic summer warming is highly related 
to an increase in snow-free days and the transition from 
tundra (short to be easily buried by snow) to forest. 

LSMs compute snowwater equivalent (SWE), snow 
depth, and fractional area covered by snow. The repre¬ 
sentation of snow processes in the first- and second- 
generation LSMs was relatively simple using a bulk 
snow layer (e.g., [26, 90, 91]). Third-generation models 
tend to have multiple snow layers to resolve the internal 
processes of snowpack such as densification, liquid 
water flow, and multiphase changes of water [42-44, 
92]. The multilayer structure also has a thin surface 
layer designed to produce a more accurate ground heat 
flux and snow skin temperature [60]. 

Bulk-layer models combine the snowpack with the 
top soil layer as a bulk layer. These models, for instance, 
BATS and SiB, use the force-restore approach to 
solve snow temperature. However, the force-restore 
approach cannot accurately resolve high-frequency 
variations in snow surface forcing and temperature, 
and thus induce errors in snowmelt. Most of the mul¬ 
tilayer snow models were simplified from schemes with 
detailed internal processes such as grain-size growth 
and gravitational flows of liquid water within a 


snowpack (e.g., [93, 94]). In a multilayer model like 
CLM4, the snow mass balance includes ice and liquid 
water content, but neglects the vapor phase. Densifica¬ 
tion processes include destructive or equi-temperature 
metamorphism, compaction due to the weight of the 
overlying layers of snow mass, and melt metamorphism, 
following Anderson [93]. Excessive liquid water above 
the holding capacity (ratio of liquid water volume to 
the free space of a snow layer) flow down to next layer. 
The melting (or freezing) energy for a layer is assessed 
as the energy excess (or deficit) needed to change the 
temperature of the layer to the freezing point [60]. 

Researchers also recognized that snowpack model¬ 
ing for hydrological applications and climate studies 
needs to account for the radiative effects of the vegeta¬ 
tion canopy (e.g., [95]) and the impacts of the inter¬ 
ception of snow by the vegetation canopy [80, 96]. In 
addition, the turbulent transfer of sensible heat flux 
below the vegetation canopy under a stable condition 
during melting season significantly affects melting 
snow on the ground [81]. However, these studies were 
not fully introduced into ESMs. 

Snow depth varies greatly at subgrid-scales because 
of heterogeneities in land cover, terrain, snow deposi¬ 
tion, snowmelt, and meteorological conditions [40]. In 
LSMs, subgrid snow distributions are represented as 
snow cover fraction (SCF), that is, the fraction of a grid 
cell covered by snow, through the relationship between 
SCF and snow depth. At an ESM grid cell scale, one of 
the largest uncertainties in modeling snow and its 
interactions with the atmosphere comes from SCF for¬ 
mulations and their associated parameters. Various 
SCF formulations as summarized in Liston [40] result 
in a wide spread of SCFs between models. ESMs vary 
significantly in simulating SCF, and most of them 
underestimate SCF [97] because of unrealistic SCF 
formulations. Most SCF formulations are parameter¬ 
ized as a function of grid cell mean snow depth and the 
ground roughness length [40, 41, 81, 90] . Some of them 
also considered the impact of subgrid topography var¬ 
iations on SCF [91]. Using gridded snow depth and 
SWE reanalysis and the advanced very high resolution 
radiometer (AVHRR) SCF [87], Niu and Yang [41] 
found the SCF-snow depth relationship varies with 
seasons and introduced snow density into an existing 
SCF formulation [90] to approximate the seasonal 
variation in the SCF-snow depth relationship. 
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Soil Water Although soil water accounts for a very 
small portion (~1%) of the terrestrial water storage, it 
plays an important role in the climate system through 
its controlling role on ET. Soil moisture anomalies 
(i.e., deviation from the mean) can persist from weeks 
to seasons [98, 99]. Numerous researchers have inves¬ 
tigated its impact on climate predictability using GCMs 
(e.g., [100]). Koster et al. [19] revealed that the land’s 
influence is dominant over midlatitude land areas. In 
analogy to the oceanic hot spot, for example, the east¬ 
ern equatorial Pacific, playing a key role in the El Nino- 
La Nina cycle, Koster et al. [20] found the terrestrial 
“hot spots,” that is, the arid-to-wet transition regions, 
where precipitation is affected by soil moisture anom¬ 
alies during boreal summer. These hot spots were fur¬ 
ther confirmed by Zeng et al. [101] using an alternative 
index accounting for the land-precipitation coupling 
strength. For this reason, a proper initialization of soil 
moisture can enhance precipitation prediction skill 
over these hot spots [102-104]. Also, upgrades of 
LSMs with more elaborate representations of soil mois¬ 
ture can improve climate prediction skill especially 
during ENSO neutral years when the SST signal is 
relatively weak [ 104] . 

Temporal variation in the total soil moisture of 
a soil column within a certain depth (e.g., 3.4 m in 
CLM4) is controlled by infiltration, that is, the residual 
of precipitation minus runoff and evaporation at the 
soil surface, root uptake through transpiration, and the 
water flux at the bottom. The vertical distribution of 
soil water is controlled by vertical root distribution and 
soil property that determine gravitational and capillary 
water fluxes. Evolving from the first-generation 
“bucket” models, which neglects the capillary water 
flux, some second- and third-generation models 
directly solve the ID, mass-based (0) Richards equation 
to calculate the vertical distribution of soil moisture 
[61, 67]. Modern LSMs also tend to increase the 
number of soil layers to improve the accuracy of 
the solution. Various model intercomparison projects 
indicated that LSMs have a much better ability to 
simulate the soil moisture anomalies than to simulate 
the absolute value of soil moisture (e.g., [105, 106]). 

The mass-based form (0) of the Richards equation 
cannot resolve the soil water distribution at both unsat¬ 
urated zone and saturated zone (groundwater) as con¬ 
tinuously as a pressure-based (\[/) Richards equation. 


However, when solved through finite-difference 
numerical scheme, it conserves mass better than the 
pressure-based form. Zeng and Decker [107] demon¬ 
strated that a previous version of CLM cannot maintain 
the hydrostatic equilibrium of soil moisture distribu¬ 
tion, and this problem cannot be resolved by increasing 
the vertical resolution of the soil column. In CLM4, 
they proposed a revised form of the Richard equation 
by explicitly subtracting the hydrostatic equilibrium 
pressure profile, i/f £ (z), from the soil water pressure 
profile, \ 

Frozen Soil Frozen soil has a tremendous impact on 
soil thermal and hydrological properties, ecosystem 
diversity and productivity, and greenhouse gas release 
from the soil. Freezing of soil water delays the winter 
cooling of the land surface; thawing of the frozen soil 
delays the summer warming of the land surface. Frozen 
soil also affects the snowmelt runoff and soil hydrology 
by reducing the soil permeability. Runoff from the 
Arctic river systems is about 50% of the net flux of 
freshwater to the Arctic Ocean [108]. This is a large 
percentage when compared to the freshwater inputs to 
the tropical oceans, where freshwater input is domi¬ 
nated by precipitation. Runoff and the degree of fresh¬ 
ening can affect ocean salinity, sea ice conditions, and 
thermohaline circulation. 

Earlier LSMs did not explicitly solve soil ice con¬ 
tent. These LSMs showed a much larger scatter in the 
simulated soil moisture and runoff in spring than in 
other seasons due to different representations of the 
effects of frozen soil on infiltration [ 109] . Some of them 
switched off infiltration for subfreezing temperatures, 
for example, in simplified SiB [25] and BATS [110]. 
This treatment failed to produce the spring peaks of 
soil moisture due to the underestimated infiltration 
[111, 112]. Xue et al. [112] improved SSiB’s ability to 
simulate the spring peaks of soil moisture by gradually 
decreasing the hydraulic conductivity at a rate of 10% 
per degree for subfreezing temperatures following 
SiB2 [113]. Pitman et al. [45] implemented an explicit 
representation of the hydrological and thermal effects 
of soil ice in their LSM but found that the representa¬ 
tion degraded runoff simulation in a large-scale 
river basin. 

Field studies showed that the effect of frozen soil on 
infiltration depends on scale, surface conditions of 
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snow and vegetation, and soil structures. Shanley and 
Chalmers [114] showed that the effects of frozen soil on 
runoff are scale-dependent. There was no significant 
correlation between seasonal runoff ratios and ground 
frost depth for the 15 year of record from the Sleepers 
River watershed, USA, with an area of 111 km 2 , while 
the increased runoff due to frozen ground was observed 
occasionally in its 0.59-km 2 agricultural subcatchment. 
Lindstrom et al. [115] also concluded that there were 
no clear effects of frozen soil on the timing and mag¬ 
nitude of runoff from an analysis of 16-year data in 
a 0.5-km 2 watershed in northern Sweden. Researchers 
have demonstrated that soil structure, air-filled poros¬ 
ity, ice content, and the number of freezing and 
thawing cycles are the governing factors affecting the 
infiltration capacity of frozen soil. Even at very local 
scales, recent laboratory and field studies using dye 
tracer techniques (e.g., [116]) revealed that water can 
infiltrate into deeper soil through preferential pathways 
where air-filled macropores exist at the time of freezing. 

One-dimensional numerical models using the 
fully coupled heat and mass balance equations [46, 
59, 117-119] used a variety of ways to parameterize 
the hydraulic properties of frozen soil. Some of the 
models [117, 119] assume that the freezing-thawing 
process is similar to the drying-wetting process with 
regard to the dependence of the soil matric potential on 
the liquid water content. This assumption leads to 
a very low infiltration rate or even upward water move¬ 
ments resulting in ice heave in surface layers. However, 
other modelers [46, 47, 59, 118] proposed various 
schemes to compute hydraulic properties as a means 
to produce greater infiltration rates. Stahli et al. [118] 
proposed two separate domains for the water infiltra¬ 
tion into frozen soil: the low-flow domain where water 
flows through the liquid water film absorbed by the soil 
particles and the high-flow domain where water flows 
through the air-filled macropores. Koren et al. [46] 
assumed that frozen soil is permeable due to soil struc¬ 
tural aggregates, cracks, dead root passages, and worm 
holes and reduced the effects of frozen soil on runoff 
production. Cherkauer and Lettenmaier [59] assumed 
that surface water tends to find areas of higher infiltra¬ 
tion capacity as it flows across a frozen surface. They 
split their model domain into 10 bins with each having 
different ice content, which was derived from the 
observed spatial distribution of soil temperature, to 


increase the infiltration rate in VIC model. Niu and 
Yang [47] introduced fractional permeable area, which 
partitions an ESM grid cell into an impermeable frac¬ 
tion and a permeable fraction, and used the total soil 
moisture (liquid water and ice) to calculate soil matric 
potential and hydraulic conductivity over the perme¬ 
able fraction. 

When soil water freezes, the water closest to soil 
particles remains in liquid form due to the absorptive 
and capillary forces exerted by soil particles. For this 
reason, there is a certain amount of liquid water 
coexisting with ice over a wide range of temperatures 
below 0°C. This supercooled soil water at subfreezing 
temperatures can be computed with the freezing point 
depression equation [46, 47] or parameterized with 
observational data [120]. Most LSMs, for example, 
CLM4, consider ice and liquid water content but neglect 
the water vapor phase. The energy available for phase 
change between ice and liquid water is assessed in the 
same way as for snow except that only the excess above 
the supercooled liquid water can be frozen in the soil. 
Li et al. [121] provided a thorough summary of frozen 
soil models with different levels of complexity. 

Groundwater Groundwater storage constitutes 
about 30% of the terrestrial water storage. Groundwa¬ 
ter storage shows as large variations as that of soil water 
at monthly or longer timescales in Illinois [122]. The 
groundwater level also shows a strong diurnal cycle in 
aquifers where the water table depth is less than 2 m 
because of the water uptake by the roots of the above¬ 
ground plants [51]. The rise and fall of the water table 
directly interacts with soil moisture in the vegetation 
root zone and thus acts as a source term for ET. 
Groundwater dynamics also control runoff generation, 
which can further affect the computation of soil mois¬ 
ture and ET in a climate model. Observational data 
show that runoff is much more related to the water 
table depth than to precipitation at monthly timescale 
in Illinois [50]. 

Earlier LSMs did not explicitly account for ground- 
water storage and its interactions with soil moisture. 
When solving the soil moisture equation, these LSMs 
assumed zero flux or free drainage at the bottom of the 
soil column as the lower boundary condition. For 
instance, an earlier version of CLM (version 3.0) used 
gravitational free drainage as its lower boundary 
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condition at a depth of 3.4 m. Water drained from the 
soil bottom under gravity during wet seasons or wet 
years (parameterized as subsurface runoff) would 
never come back to the soil column in dry seasons or 
dry years. In reality, when the soil is drying, the soil can 
draw water through capillary suction from its underly¬ 
ing aquifer (or saturated layers), which has a longer 
memory of the past precipitation events than the soil. 
LSMs implicitly include such a process when the water 
table is shallow enough to be within the model soil 
layers. However, when the water table is below the 
model’s soil bottom, LSMs neglect this upward water 
flow through capillary forces while taking into consid¬ 
eration gravitational free drainage at the model bot¬ 
tom, resulting in unrealistically dry deep soil. 

During the past decade, the need for a groundwater 
component in LSMs has received increasing attention. 
As a result, a number of researchers have incorporated 
a groundwater component into LSMs [48-52, 123]. 
These models, for use in climate studies, can be divided 
in two categories: 3D models [51, 123] and ID models 
[48-50, 52]. Because the horizontal transport of 
groundwater is more pronounced at a smaller scale, 
3D models are more applicable to regional climate 
and water resource studies, while ID models are often 
used in ESMs. 

Researchers [48-50] have proposed more realistic 
solutions to solve the continuum of soil and relatively 
shallow aquifers at the expense of adding more inte¬ 
gration elements, that is, 100 nodes, 50 layers, and 
10 layers, respectively, in the above-referenced studies. 
Even with these additional layers, the models are only 
applicable for relatively shallow aquifers (<5 m) 
because of the prescribed depth of the model. Niu 
et al. [52] added a single integration element below 
the soil of CLM and parameterized groundwater 
recharge rate according to Darcy’s law to account for 
the interactions between soil water and groundwater. 
Zeng and Decker [107] proposed another simple way 
to represent the groundwater recharge rate using the 
deviation of soil water pressure from its hydrostatic 
equilibrium at the soil bottom. All these developments 
represent a step forward from the current LSMs using 
gravitational free drainage or zero flux as their lower 
boundary condition. Further development and evalu¬ 
ation are needed to more realistically represent ground- 
water in ESMs. 


Runoff Runoff is one of the major components of the 
global water cycle and accounts for about 40% of the 
precipitation that falls on land. It plays an important 
role in the global climate system by affecting ET and 
freshwater inputs to the oceans, which in turn affects 
the ocean thermohaline circulation. LSMs’ runoff for¬ 
mulation directly affects the partitioning of precipita¬ 
tion into runoff and ET and soil moisture. 

Runoff is conceptually difficult to represent in 
ESMs. The environmental factors that control runoff, 
precipitation, soil moisture, and topography, often vary 
considerably on local scales. The plethora of conceptu¬ 
alizations for runoff schemes reflects the uncertainties 
in understanding and representing fundamental 
runoff processes. Land model intercomparison projects 
(e.g., [106,124]) summarized various implementations 
of runoff schemes ranging from simple bucket models 
to more sophisticated topography-based runoff 
models. The partitioning of precipitation into evapo- 
transpiration, surface runoff and subsurface runoff 
(baseflow) varies widely among these LSMs. Climate 
models have been adjusted so that the global, multiyear 
average runoff production is about one third of the 
average precipitation. Runoff in BATS is divided 
approximately equally between surface and subsurface 
runoff to match the early observational estimates [110]. 

Most LSMs parameterize runoff processes as a sum 
of surface runoff (fast component) and subsurface run¬ 
off or base flow (slow component). The surface runoff 
is generated by two major mechanisms: Horton runoff 
(infiltration excess) and Dunne runoff (saturation 
excess). The former is assessed as the excess of precip¬ 
itation intensity over soil infiltration capability, while 
the latter is estimated as the precipitation over satu¬ 
rated and impermeable surfaces. Horton runoff is more 
difficult to represent in ESMs because the precipitation 
intensity produced by ESMs is unrealistically small 
(“drizzle” problem) and thus precipitation needs to 
be distributed in both space and time to mimic storms. 
Hydrologists introduced the concept of fractional sat¬ 
urated area as the dominant control on surface runoff. 
Surface runoff is largely dominated by the fractional 
saturated area, that is, the saturated fraction of a grid 
cell of ESMs, where rainfall or snowmelt water imme¬ 
diately flows into local river systems [125-127]. 

The fractional saturated area is conceptually corre¬ 
lated with near-surface soil moisture as it is represented 
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in BATS [110]. More recent implementations [38, 39, 
127-129] relate the fractional saturated area to the 
water table depth (or water deficit depth) and the 
subgrid characteristics of the topography following 
TOPMODEL [126, 130]. The latter is described by 
statistics of the “topographic index” (or “wetness 
index,” A = ln(a/tan/?), where a is the specific catchment 
area, i.e., the upstream area above a pixel that drains 
through the unit contour at the pixel, and tan/? is the 
local surface topographic slope) computed from the 
high-resolution digital elevation model (DEM) [131]. 
Famiglietti and Wood [38] proposed a discretized 
framework in which the distribution of the topo¬ 
graphic index was disaggregated into a number of 
bands, each representing a fraction of the watershed 
with similar water table depth and soil moisture, to 
parameterize the subgrid variability in soil moisture 
and runoff. However, its structural conflicts with cli¬ 
mate models and its high computation costs impeded 
its application to ESMs. Most applications (e.g., [127- 
130]) used a three-parameter gamma distribution 
function to represent the pdf of the topographic 
index. By fitting the pdf of the topographic index 
with an exponential function, Niu et al. [39] simply 
expressed the saturated fraction as an exponential func¬ 
tion of the water table depth. 

The slow component was formulated as drainage 
under gravity in BATS [110] or a downslope lateral 
drainage in SiB [24]. TOPMODEL formulated its 
base flow as an exponential function of the soil water 
storage deficit [126] or the water table depth [ 130] . The 
TOPMODEL-based runoff schemes for ESMs also 
parameterized base flow as an exponential function of 
the catchment water storage deficit [128] or the water 
table depth [38, 39, 127, 129]. The exponential rela¬ 
tionship between runoff (surface runoff and base flow) 
and the water table depth was demonstrated by obser¬ 
vations [50]. 

Carbon Budgets and Vegetation Dynamics 

The terrestrial ecosystem stores very large amounts of 
carbon, which is estimated at about 500 Pg of carbon 
(1 Pg C = 10 15 g C) in plant biomass and 2,000 Pg of 
carbon in soil organic matter (top 1 m) [132]. The 
atmosphere, with a C0 2 concentration of 370 ppmv, 
contains about 785 Pg C as C0 2 [132]. Terrestrial 


ecosystem absorbs atmospheric C0 2 through photo¬ 
synthesis at a rate of about 120 Pg C per year (gross 
primary productivity; GPP) and releases about half of 
that back to the atmosphere as C0 2 by respiration, 
resulting in net primary production (NPP) at about 
60 Pg C per year. At the same time, heterotrophic 
respiration from soil and fire return about 60 Pg 
C per year back to atmospheric C0 2 , closing the loop. 
The carbon fluxes between terrestrial ecosystem and 
the atmosphere have been quite stable for millennia. 
However, losses of C from land use change have been 
steadily increasing over the last one-and-half centuries, 
approaching rates of about 2 Pg C per year, mostly 
from tropical deforestation. Net cumulative C0 2 emis¬ 
sions from land use change may be close to 200 Pg 
C during the human history [132], about 25% of the 
atmospheric C0 2 at the level of 370 ppmv. Therefore, 
ESMs simulating the evolution of climate for centuries 
of the past and the future should represent these 
processes more realistically in their LSMs. ESMs also 
need to realistically simulate the relative role of land 
versus ocean in taking up the C0 2 released by anthro¬ 
pogenic sources. 

Representing carbon budgets and vegetation 
dynamics is a major feature of the third-generation 
LSMs. The third-generation models try to fully repre¬ 
sent plant physiology and phenology. Most of these 
LSMs use a semiempirical model of leaf conductance 
[1, 30, 33] to link the stomatal resistance with photo¬ 
synthesis processes: 

1 /r st = m — — P + g a (16) 

G € S at 

where m is an empirical coefficient (~9 for most C 3 
plants and for C 4 plants; a larger m means the leaf 
consumes more water to assimilate the same carbon), 
A n is the net carbon assimilation rate, c a and e a are the 
C0 2 concentration and vapor pressure at the leaf sur¬ 
face, respectively, e sat is the saturated vapor pressure at 
the leaf temperature, Pis the atmospheric pressure, and 
g 0 is the minimum stomatal conductance. 

The net carbon assimilation is calculated as the 
minimum of three limiting factors: the Rubisco-lim- 
ited, light-limited, and export-limited rates of carbox- 
ylation rate for C 3 plants and PEP-carboxylase 
limitation for C 4 grasses based on Collatz et al. 
[30, 133]. Both Rubisco-limited and export-limited 


E 




3224 


E 


Earth System Model, Modeling the Land Component of 


rates of carboxylation are related to the maximum rate 
of carboxylation, which varies with temperature, 
foliage nitrogen, and soil water: 

^max = V max25 a^f(N)f(T v )P (17) 

where V maX 25 is maximum carboxylation rate at 25°C 
depending on plant functional types (PFTs) and 
maxis the Q 10 parameter (i.e., for an increase in tem¬ 
perature by 10°C, Vmax would increase by a vmax ), /( T v ) 
is a function that mimics thermal breakdown of meta¬ 
bolic processes [30, 32],/(AT)is a foliage nitrogen factor 
and, the /? factor is the soil moisture stress factor. 

Most third-generation models, for example, SiB2 
[113], IBIS [5], MOSES [117], and CLM4 [61] repre¬ 
sent plant physiology using the above formulations. To 
simulate the net carbon budgets from the terrestrial 
ecosystem, the models also included representations 
of autotrophic respiration and heterotrophic respira¬ 
tion. For instance, CLM4 represents autotrophic respi¬ 
ration as the sum of maintenance respiration and 
growth respiration. Maintenance respiration is 
a function of temperature and tissue nitrogen concen¬ 
tration for live biomass [ 134] , while growth respiration 
is assumed to be about 0.3 of the total carbon in new 
growth on a given time step based on construction cost 
for a range of woody and nonwoody tissues. CLM4 also 
represents decomposition, mineralization, and immo¬ 
bilization processes of carbon and nitrogen stored in 
three litter pools, three soil organic matter pools, and 
a coarse woody debris pool, under the soil moisture 
and temperature controls [134]. 

These models also describe plant phenology. For 
instance, CLM4 predicts LAI, SAI, and vegetation 
heights, including three distinct phenological types: 
evergreen, seasonal-deciduous, and stress-deciduous. 
The seasonal-deciduous phenology algorithm is based 
on the parameterizations for leaf onset and offset for 
temperate deciduous broadleaf forest [135]. Initiation 
of leaf onset is triggered when a common degree-day 
summation exceeds a critical value, and leaf litterfall is 
initiated when daylength is shorter than a critical value. 
The stress-deciduous phenology algorithm is based in 
part on the grass phenology model proposed by White 
et al. [135]. The algorithm handles phenology for veg¬ 
etation types such as grasses and tropical drought- 
deciduous trees that respond to both cold and drought 


stresses, and that can have multiple growing seasons 
per year. The overall reasonable performance of CLM4 
has been documented in Lawrence et al. [4]. 

Future Directions 

LSMs have been developed for more than 3 decades, 
focusing more on representations of biogeophysical 
processes, that is, the energy and water processes occur¬ 
ring in various terrestrial components, and are now at 
their third generation. They are becoming more com¬ 
plex and more realistically process based than earlier 
generations. The third-generation models already 
include terrestrial ecosystem dynamics and associated 
physiological effects on terrestrial carbon budgets. 
However, processes affecting greenhouse gas (GHG) 
emissions from various natural and anthropogenic 
sources over land are still not fully represented in these 
models. GHG emissions from various land surfaces are 
intimately coupled with various terrestrial biogeochem¬ 
ical processes. As suggested by Pitman [2], the need for 
an LSM to capture biogeochemical processes, which are 
tightly coupled to hydrological processes, is increasing 
and this provides focuses for future developments. 

Numerous researchers have investigated the feed¬ 
back of terrestrial ecosystem to climate change for the 
next century using the third-generation models (e.g., 
[15, 136]). Despite differences in details, these results 
demonstrate an important but uncertain role of the 
future ecosystems in the climate system. In these sim¬ 
ulations, the terrestrial ecosystem functions as a sink 
through about 2050 and then turns into a source due to 
the collapse of the soil-carbon sink, resulting in an 
acceleration of global warming (1.5 K higher than 
without the carbon feedback). However, these results 
are questionable, because the terrestrial ecosystem and 
related biogeochemical processes are more complex 
than what are currently represented in these LSMs. 
For instance, these models do not fully represent key 
processes for stabilizing soil carbon, the hysteresis rela¬ 
tionship between soil respiration and temperature, and 
the isoprene protective effects on photosynthesis under 
high temperature and light stress, etc. As an example, 
CLM4 includes the controlling role of soil temperature 
and water potential in the decomposition of leaf litter 
and soil organic carbon, but it lacks a representation of 
key processes such as biogeochemical weathering, 
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heteroaggregate formation, and microbial activities. To 
represent more sensitive links between climate, micro¬ 
bial physiology, and C transformations, the response of 
microbial enzymes and active fungi to bacteria ratios to 
climatic variation and plant cover should be included 
[137, 138]. CLM4 considered emissions of biogenic 
volatile organic carbon (BVOC) including isoprene 
and monoterpenes [139] based on the emission 
model of Guenther et al. [140]. However, CLM4 
neglects the allocation of the assimilated carbon 
through photosynthesis to BVOC emissions versus 
respiratory C0 2 emissions and C0 2 inhibitive effects 
on isoprene emissions. As recently reviewed by Vickers 
et al. [141], a rich literature exists demonstrating the 
role of isoprene in protecting photosynthesis under 
high temperature and light stresses. Therefore, third- 
generation models have recently started to implement 
photosynthesis-based isoprene emission algorithms 
[ 142] . In addition, a growing body of evidence suggests 
a temperature-respiration relationship that exhibits 
a diurnal hysteresis pattern [143]. A representative 
model (based on a Q 10 factor) that does not consider 
hysteresis would overestimate soil C0 2 efflux for the 
entire growing season [ 143] . It has also been found that 
the hysteresis is caused by the imbalance of production 
and diffusion of C0 2 within soil, and soil moisture 
controls the transition from the imbalanced hysteresis 
pattern under wet soil conditions to a balanced pattern 
without hysteresis under dry soil conditions. As such, 
Riveros-Iregui et al. [ 143] suggested that the role of soil 
moisture in controlling the hysteresis temperature- 
respiration relationship should be considered in 
models. In addition, autotrophic respiration from 
roots and more elaborate representation of vertical 
root distribution should also receive more attention. 

A major indication of the anthropogenic effects on 
the climate system is the large-scale conversion of nat¬ 
ural ecosystems to cropland throughout human his¬ 
tory. Associated with this conversion are changes in 
GHG emissions and related water use. Global agricul¬ 
ture is a major contributor to increasing the GHGs, 
with ~18% of the total anthropogenic emissions, 
including 9% of the total C0 2 emissions due to land 
use (e.g., deforestation), 40% of the total CH 4 emis¬ 
sions due to rice cultivation, and 62% of the total N 2 0 
emissions due to fertilizer application for year 2000 
[144]. For irrigation, global cropland consumes about 


2,500 km 3 of water each year, which represents almost 
70% of the global fresh water withdrawal from surface 
reservoirs and aquifers [145]. LSMs have started to 
include crop models and irrigation schemes (e.g., 
CLM4). Fully representing agricultural practices, for 
example, sowing, dates, cultivars, irrigation, and fertil¬ 
ization and associated biogeochemical processes that 
affect GHG emissions and water use represents 
a direction in future LSM development. Crop models 
developed by the agriculture community and measure¬ 
ments from the Free-air Controlled Enhancement 
(FACE) [146] over various natural and agricultural 
ecosystems should be helpful for developing models 
and test models’ response to enhanced temperature 
and C0 2 concentration. 

Almost all LSMs for use in ESMs are ID, solving the 
governing equations in the vertical direction. However, 
water flows also in lateral directions at surface and in 
subsurface driven by gravity; the lateral water flow 
controls the formation and persistence of wetlands 
and governs the dynamics of freshwater inputs to the 
oceans. There is an increasing need to implement 3D 
hydrologic model into LSMs for use in ESMs. Methane 
emission, which is the second largest contributor to the 
anthropogenic GHG, is highly related to wetland 
dynamics. Of all the natural and anthropogenic sources 
of methane, methane emission from wetlands is known 
to be the single largest [ 147] . However, most LSMs lack 
a representation of wetland dynamics, prescribing wet¬ 
land area as a fixed value. To more realistically represent 
wetland area, Fan et al. [148] developed an efficient 
hydrologic framework to simulate wetlands for climate 
models, involving large-scale groundwater convergence 
that feeds wetland. The spatial resolution of ESMs is 
increasing (^10 km) with recent computational 
advances. Implementing a quasi 3D hydrological 
model, which describes lateral surface and saturated 
subsurface (groundwater) flows and operates at 
a high spatial resolution (1 km), to LSMs will become 
possible in the near future. The hydrology community 
recently proposed hyper-resolution (1 km) global land 
surface modeling for monitoring Earth’s terrestrial 
water resources, studies of biogeochemical processes, 
and agricultural applications, etc. [149]. Lessons 
learned from developing such a hyper-resolution 
model will certainly be beneficial for developing 
advanced 3D LSMs for use in ESMs. 
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This section provides sensing and data collection 
methodologies, as well as an understanding of Earths 
climate parameters and natural and man-made phe¬ 
nomena, to support a scientific assessment of the Earth 
system as a whole, and its response to natural and 
human-induced changes. The coverage ranges from cli¬ 
mate change factors and extreme weather and fires to oil 
spill tracking and volcanic eruptions. This serves as 
a basis to enable improved prediction and response to 
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climate change, weather, and natural hazards as well as 
dissemination of the data and conclusions. The data 
collection systems include satellite remote sensing, aerial 
surveys, and land- and ocean-based monitoring stations. 
Our objective in this treatise is to provide a significant 
portion of the scientific and engineering basis of Earth 
system monitoring and to provide this in 17 detailed 
articles or chapters written at a level for use by university 
students through practicing professionals. The reader is 
also directed to the closely related sections on ► Ecolog¬ 
ical Systems, Introduction and also ► Climate Change 
Modeling Methodology, Introduction as well as ► Cli¬ 
mate Change Remediation, Introduction to. For ease of 
use by students, each article begins with a glossary of 
terms, while at an average length of 25 print pages 
each, sufficient detail is presented for use by profes¬ 
sionals in government, universities, and industries. The 
chapters are individually summarized below. 

► Aircraft and Space Atmospheric Measurements 
Using Differential Absorption Lidar (DIAL) - 
A particular lidar technique called differential 
absorption lidar (DIAL) has proven to be an especially 
powerful method to measure atmospheric chemical 
species. It has been used both on the ground and in 
many aircraft to determine the concentration of gasses. 
DIAL systems can be deployed in fixed ground loca¬ 
tions, in either piloted or unpiloted aircraft or in satel¬ 
lite platforms. 

► Coral Reef Ecosystems - Over the millennia, coral 
reef ecosystems have served as the basis of human sur¬ 
vival and sustenance in many coastal areas in the tropics 
and subtropics. They are significant geologic formations 
that provide crucial protection from destructive forces of 
the sea. They also harbor the greatest diversity of animal 
phyla of any other habitat on the planet. Threats due to 
climate change are detailed. 

► Airborne and Space-borne Remote Sensing of 
Cryosphere - The cryosphere broadly constitutes all 
the components of the Earth system, which contain 
water in a frozen state. As such, glaciers, ice sheets, 
snow cover, lake and river ice, sea ice, and permafrost 
make up the terrestrial oceanic elements of the 
cryosphere. While ice particles in the upper atmo¬ 
sphere and icy precipitation near the surface are the 
representative members of the Cryosphere in atmo¬ 
spheric systems. Airborne and spaceborne remote sens¬ 
ing technologies are presented. 


► Heat Content and Temperature of the Ocean - 
Following years of research and development in 
autonomous float technology, major international 
cooperation under the Argo Project led to the seeding 
of the global ocean with more than 3,000 floats each 
with the capability of profiling temperature and other 
properties between depths of 2,000 m and the surface 
approximately every 10 days. Measurements are relayed 
by satellite while the floats are at the surface in between 
each cycle of ascent and descent. Coverage is almost 
global. Measuring variability in sea level using satellite 
altimetry in situ temperature observations are comple¬ 
mentary in efforts to infer changes in heat content. 

► Hurricane and Monsoon Tracking with 
Driftsondes - During the international African 
Monsoon Multidisciplinary Analysis (AMMA) project, 
high flying balloons (called driftsondes) dropping 
meteorological sondes to take measurements over 
remote, but scientifically important locations, were 
deployed over West Africa and tropical Atlantic to 
study weak weather systems, called African easterly 
waves, that serve as seedlings for hurricanes. 

► Large-Scale Ocean Circulation: Deep Circulation 
and Meridional Overturning - Observations from long¬ 
term moorings and deep floats have illustrated a 
dynamic deep ocean environment, with energetic vari¬ 
ability observed at a wide range of space and time scales, 
including narrow jets, eddies, and recirculating gyres. 

► Long-Term Ecological Research Network - The 
US Long Term Ecological Research (LTER) Network is 
the largest and longest-lived ecological network in the 
continental United States. Designed to provide long¬ 
term data from a broad range of key ecosystems, the 
LTER Network represents a unique national resource 
for addressing pressing environmental issues such as 
climate change, loss of biodiversity, and changes in 
patterns of land use. 

► Ocean Acidification - The oceans have absorbed 
about one third of the anthropogenic carbon emis¬ 
sions, which in part ameliorates global warming 
induced by human activity. However, this capacity to 
absorb C0 2 is challenged by the accelerated increase in 
human activities including the increasing burning of 
fossil fuels, cement production, agriculture, and defor¬ 
estation. The accelerated rate of increase in the concen¬ 
tration of atmospheric C0 2 and the parallel rise in 
dissolved C0 2 in seawater (and the associated chemical 
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changes) can affect marine life. For example, increases 
in C0 2 can potentially disrupt the acid base balance in 
tissues, fluids, and cells of marine organisms. 

► Ocean Evaporation and Precipitation - Taken as 
a whole, the hydrological cycle is characterized by the 
evaporation of about half a million cubic kilometers of 
water per year, the bulk of which (86%) is from the 
ocean, with only 14% originating in the continents. 
The vast majority of the water that evaporates from 
the oceans (90%) is precipitated back into them, while 
the remaining 10% is transported to the continents, 
where it precipitates. About two thirds of this precipi¬ 
tation is recycled over the continents and only one third 
runs off directly into the oceans. This chapter covers 
the distribution of evaporation from the ocean and 
precipitation, oceanic heat reservoirs and evaporation, 
oceanic evaporation and monsoonal precipitation 
regimes, and the response of the relationship between 
evaporation and precipitation. 

► Ocean Observatories and Information: Building 
a Global Ocean Observing Network - Ocean observa¬ 
tories are collections of networks of sensors that are 
deployed to sample the ocean physics, chemistry, and 
biology. The goal of these networks is to overcome 
chronic under-sampling of the oceans by providing 
sustained measurements in space and time. The data 
collected by these networks are used to address a range 
of basic and applied research questions, hindered by 
a lack of data. This chapter covers the need for a global 
ocean observing network, design considerations, 
platforms, and information systems. 

► Oil Spill Remote Sensing - Remote sensing for oil 
spills is reviewed. The technical aspects of sensors are 
considered and the benefits and limitations of each sen¬ 
sor are assessed. Oil spill response often requires that 
remote sensing be used to detect, map and track the spill 
of interest. A wide variety of technologies are covered. 

► The Gravity Recovery and Climate Experiment 
(GRACE) Detection of Ice Mass Loss, Terrestrial Mass 
Changes, and Ocean Mass Gains - The gravity field of 
Earth, caused by the distribution of masses inside and 
on the surface of Earth, changes in time due to the 
redistribution of mass. Such mass fluxes can be due 
both to natural processes (such as the seasonal water 
cycle, ocean dynamics, or atmospheric variations), as 
well as due to human actions, such as the systematic 
withdrawal of groundwater for human consumption. 


The GRACE satellite pair has provided the first global 
measurements at horizontal resolutions from 300 km 
to global, and time scales from 10 days to interannual. 
These measurements have been used to assess the mass 
variability in the oceans, terrestrial water storage, and 
loss of ice mass in glaciers and ice sheets. Long-term 
trends, episodic variations such as result from large 
earthquakes, and seasonal changes have all been 
measured with unprecedented accuracy and detail. 

► Remote Sensing of Natural Disasters - Remote 
sensing involves the use of instruments to study phe¬ 
nomena from a distance. Natural disasters derive from 
hazards such as volcanoes, flooding, earthquake fires, 
and weather. Practically speaking, “remote sensing 
of natural disasters” principally refers to the use of 
airborne or spaceborne sensors to study natural disas¬ 
ters for detecting, modeling, predicting, analyzing, and 
mitigating effects on human populations and activities. 
Satellite instruments can provide imagery of large areas 
of Earth’s surface and overflights can often easily 
provide data for hard to access locations (either due to 
terrain, weather, or causes of the disaster itself). Air¬ 
borne instruments can also provide much of the same 
utility from remote sensing with the potential advan¬ 
tages of greater loiter or dwell time and rapid response. 

► Remote Sensing of Ocean Color - The oceans 
cover over 70% of Earth’s surface and the life inhabiting 
the oceans play an important role in shaping Earth’s 
climate. Phytoplankton, the microscopic organisms in 
the surface ocean, are responsible for half of the pho¬ 
tosynthesis on the planet. These organisms at the 
base of the food web take up light and carbon dioxide 
and fix carbon into biological structures releasing 
oxygen. Estimating the amount of microscopic phyto¬ 
plankton and their associated primary productivity 
over the vast expanses of the ocean is extremely chal¬ 
lenging from ships. However, as phytoplankton take 
up light for photosynthesis, they change the color of 
the surface ocean from blue to green. Such shifts in 
ocean color can be measured from sensors placed high 
above the sea on satellites or aircraft and are called 
“ocean color remote sensing.” In open ocean waters, 
the ocean color is predominantly driven by the phyto¬ 
plankton concentration and ocean color remote 
sensing has been used to estimate the amount of chlo¬ 
rophyll a, the primary light-absorbing pigment in all 
phytoplankton. 
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► Earth System Environmental Literacy - This 
chapter details the standards of ecological, environ¬ 
mental, and science literacy needed by the public to 
support decisions involved with assessing climate 
change data and predictions and taking the required 
actions. 

► Contemporary Sea Level Variations, Observa¬ 
tions and Causes - Sea-level change is a very sensitive 
index of climate change and variability. For example, 
as the ocean warms in response to global warming, 
seawaters expand, and thus sea level rises. When moun¬ 
tain glaciers melt in response to increasing air temper¬ 
ature, sea level rises because of freshwater mass input to 
the oceans. Similarly, ice mass loss from the ice sheets 
causes sea-level rise. Corresponding increases in fresh¬ 
water transport into the oceans changes water salinity, 
hence seawater density as well as ocean circulation that 
in turn affects sea level and its spatial variability. Since 
the early 1990s, satellite altimetry has become the main 
tool for precisely and continuously measuring sea level 
with nearly global coverage and a few days revisit time. 
Compared to tide gauges which provide sea level 
relative to the ground, satellite altimetry measures 
“absolute” sea-level variations. The main factors 
causing current global mean sea-level rise are thermal 
expansion of seawaters, land ice loss, and freshwater 
mass exchange between oceans and land water reser¬ 
voirs. These contributions vary in response to natural 
climate variability and to global climate change 
induced by anthropogenic greenhouse gas emissions. 

► Volcanoes, Observations and Impact - Volcanoes 
are critical geologic hazards that challenge our ability to 
make long-term forecasts of their eruptive behaviors. 
They also have direct and indirect impacts on human 
lives and society. As is the case with many geologic 
phenomena, the time scales over which volcanoes evolve 
greatly exceed that of a human lifetime. On the other 
hand, the time scale over which a volcano can move from 
inactivity to eruption can be rather short: months, 
weeks, days, and even hours. Thus, the scientific study 
and monitoring of volcanoes is essential to mitigate risk 
as well as measure the influx of gasses and particulates 
into the atmosphere. There are thousands of volcanoes 
on Earth, and it is impractical to study and implement 
ground-based monitoring at them all. Fortunately, there 
are other effective means for volcano monitoring, 
including increasing capabilities for satellite-based 


technologies. The focus of this chapter is mainly on 
geophysical observations of volcanoes and calderas as 
they pertain to eruption forecasting and prediction. 
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Glossary 

Damage The damage is defined as the disruption of 
the integrity/failure of a structure or function of a 
system. 

Earthquake fault It is the fault, along which relative 
ground displacement occur during an earthquake. 
Engineering structures Structures (i.e., dam, tunnel, 
slope cut, pylon, bridge, viaduct, etc.) with a given 
function or purpose are those designed and built 
according to the principles of engineering. 

Ground motions Ground motion is a change of a 
material point in space in a given time and is 
observed as displacement, velocity or acceleration 
and they are caused by the application of forces on a 
given material body (i.e., solid, fluid). 

Definition of the Subject and Its Importance 

Ground motion characteristics, deformation, and sur¬ 
face breaks of earthquakes depend upon the causative 
faults. Their effects on the seismic design of engineering 
structures are not considered in the present codes of 
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design, although there are attempts to include these 
factors in some countries (i.e., USA, Japan, Taiwan). 
This chapter describes ground motions and the effect 
of surface ruptures associated with earthquake faulting 
on response and stability of engineering structures. 

Introduction 

Earthquakes are known to be one of the natural disas¬ 
ters resulting in the huge losses of human lives and 
properties as experienced in the recent earthquakes. 
Since there is no way to prevent the occurrence of 
earthquakes in earthquake-prone countries such as 
Turkey, Japan, USA, and Taiwan, the design of struc¬ 
tures and residential and industrial developments must 
be done according to possible faulting types of earth¬ 
quakes (Fig. 1). It is well known that ground motion 
characteristics, deformation, and surface breaks of 
earthquakes depend upon the causative faults [1-13]. 
While many large earthquakes occur along the subduc- 
tion zones, which are far from the land, and their effects 
appear as severe shaking, the large inland earthquakes 
may occur just beneath or nearby urban and industrial 
zones as observed in the recent great earthquakes. 

The seismic design of engineering structures is gen¬ 
erally formed by considering the possible shaking char¬ 
acteristics of the ground during earthquakes in a given 
region. It is a fact that the residual (permanent) relative 
displacement of the ground is not considered in any 
seismic code all over the world, except for very long 
linear structures such as pipelines. This problem is cur¬ 
rently considered to be beyond the capability of seismic 
design concept for structures in the earthquake engi¬ 
neering, although it must be dealt with somewhat [1,2]. 


In this chapter, the author first describes ground 
motions and deformations in view of laboratory exper¬ 
iments on rock samples and recordings in earthquakes 
having different faulting mechanisms. Then the effects 
of surface ruptures and deformations due to earth¬ 
quake faulting on the response and stability of engi¬ 
neering structures through observations in recent great 
earthquakes are presented and discussed. 

Ground Motions 

It is observationally known that the ground motions 
induced by earthquakes could be much higher in the 
hanging-wall block or mobile side of the causative fault, 
as observed in the recent earthquakes such as the 1999 
Kocaeli earthquake (strike-slip faulting), the 1999 
Chi-chi earthquake (thrust faulting), the 2004 Chuetsu 
earthquake (blind thrust faulting), and the 2000 
Shizuoka earthquake and L’Aquila earthquake (normal 
faulting) [7, 10, 12-15]. Figure 2 illustrates the effect of 
hanging-wall effect on the maximum ground accelera¬ 
tions observed in 1999 Chi-chi earthquake (Taiwan), 
1999 Diizce earthquake (Turkey), and 2001 Geiyo earth¬ 
quake (Japan) with different faulting mechanisms [9]. 

The recent advances in measurement, monitoring, 
and logging technologies enable measurement and 
monitoring of the dynamic responses of geomaterials 
during fracturing and slippage. Studies concerning the 
dynamic responses of geomaterials during fracturing 
and slippage can now be more easily undertaken than 
in the past. Such studies have been recently undertaken 
at several sites [1, 6-9, 16-19]. The experiments have 
been performed on geomaterials ranging from very soft 
materials to hard rocks such as siliceous sandstone by 
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Attenuation of maximum ground accelerations for some earthquakes 


using different loading schemes and loading frames 
and specially designed stick-slip experimental device. 

Aydan and Ohta [3, 9] reported some tests on 
samples of various rocks such as Ryukyu limestone, 
tuff, granite, porphyrite, andesite, and sandstone. Two 
examples are Fuji-TV No.l and Mitake Sandstone 
MS2. Fuji-TV No.l granite is a prismatic sample 
(100 x 100 x 200 mm) (Fig. 3). The acceleration 
responses start to develop when the applied stress 
exceeds the peak strength, and it attains the largest 
value just before the residual state is achieved as seen 
in Fig. 3. Another important aspect is that the acceler¬ 
ation of the mobile platen is much larger than that of 
the stationary platen. This is a common feature in all 
experiments. In another words, the amplitude of accel¬ 
erations of the mobile part of the loading system is 
higher than that of the stationary part. 

It is well known that the ground motions are gen¬ 
erally smaller than those at ground surface. Nasu [20] 
carried out first instrumental studies on tunnels during 
the aftershock activity following 1924 Izu earthquake 
with a 2.4-m offset. Kanai and Tanaka [21] measured 
ground acceleration in underground caverns and at the 
ground surface. These measurements indicated that the 
surface acceleration was generally twice or greater than 
twice of that at depth as expected theoretically. 

Figure 4 shows the records of accelerations at the 
ground surface and at bedrock 260 m below at 


Ichinoseki strong-motion station (IWTH25) of KIK- 
NET [22] strong-motion network of Japan measured 
during the 2008 Iwate-Miyagi earthquake. The strong- 
motion station was located on the hanging-wall side of 
the fault, and it was very close to the surface rupture. As 
noted from the figure, the ground acceleration of the 
UD component was amplified by 5.67 times that at the 
bedrock. This record is also the highest strong motion 
recorded in the world so far. 

Figure 5 shows the acceleration records measured 
on the ground surface (GSA) and underground gallery 
(GSG) during the 2009 Mw 6.3 L’Aquila earthquake 
[14, 23]. The GSA station is at Assergi, and the GSG 
station is located in an underground gallery of Gran 
Sasso Underground Physics Laboratory of Italy. Both 
stations are founded on Eocene limestone with a shear 
wave velocity of 1 km/s. Although the epicentral dis¬ 
tances and ground conditions are almost the same, the 
acceleration at ground surface is amplified almost 
6.4 times that in the underground gallery. 

There are many empirical attenuation relations for 
estimating ground motions [24-28]. Including the 
possible next generation attenuation (NGA) relations, 
all these equations are essentially spherical or cylindri¬ 
cal attenuation relations and cannot take into account 
directivity effects. As it is shown in the beginning 
of this section, ground motions such as maximum 
ground acceleration (AMAX) and maximum ground 
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Acceleration and axial response of sandstone and granite samples 


velocity (VMAX) have strong directivity effects in rela¬ 
tion to fault orientation. Furthermore, these relations 
are generally far below the maximum ground accelera¬ 
tion, and they are incapable of obtaining the maximum 
ground acceleration (AMAX) or the preferred term 
“peak ground acceleration (PGA) ” 

Aydan and Ohta [4] proposed an attenuation rela¬ 
tion by combining their previous proposals [5, 25, 26] 
together with the consideration of the inclination and 
length of earthquake fault using the following func¬ 
tional form (Fig. 6): 

a max = Fi(V s ) * F 2 (R, 9 , <p,L*)* F 3 (M) (1) 

where V s , 0, (p, L*, and M are the shear velocity of 
ground, the angle of the location from the strike and 
dip of the fault (measured anticlockwise with the 


consideration of the mobile side of the fault), and 
earthquake magnitude. L* (in km) is a parameter 
related to the half of the fault length. 

The following specific forms of functions in Eq. 1 
were put forward as: 

F^Vs) = Ae~ v ‘/ B (2a) 

F 2 (R,9,(p,L*) = e - R ( 1 - Dsin6 + Esin2e )( 1 + Fcos( p)/ L * 

(2b) 

F 3 (M) = e M/G - 1 (2c) 

The same form can be also used for estimating the 
maximum ground velocity ( V mSiX ). The constants of the 
functions for maximum ground acceleration and 
velocity for intraplate earthquakes without consider¬ 
ation of faulting sense are given in Table 1. 
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Earthquake Faulting, Ground Motions and Deformations. Figure 4 

Acceleration records at ground surface and bedrock at Ichinoseki strong-motion station IWTH25 of KIK-NET in Iwate- 
Miyagi earthquake 


For interplate and slab earthquakes, only the value 
of constant G is different, and it has the value of 1.05, 
while other parameters remain the same. 

L*, which is approximately associated with the half 
of the fault length, is related to the moment magnitude 
in the following form: 

V = a+be cMw . (3) 

This parameter easily accounts for the effect of fault 
length. A similar form was initially proposed by Aydan 
et al. [18] for the effects of earthquake faults on under¬ 
ground structures. However, it should be noted that the 
attenuation characteristics of the earth’s crust vary 


depending upon the location and the sense of faulting. 
The values of coefficients of Eq. 3 are given in Table 2. 

Equation 1 takes into account the location of obser¬ 
vation points with respect to earthquake fault and 
ground conditions, which are fundamentally the main 
causes of scattering of observational data seen in spher¬ 
ical models for attenuation of ground motions includ¬ 
ing NGA relations [24-28]. 

Ground Deformations 

A fault is geologically defined as a discontinuity in 
geological medium along which a relative displacement 
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Earthquake Faulting, Ground Motions and Deformations. Figure 5 

Acceleration records at GSA and GSG strong-motion stations [14] 
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Earthquake Faulting, Ground Motions and Deformations. Figure 6 

Some examples of faulting: (a) normal faulting; (b) strike-slip faulting; (c) thrust faulting 
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Earthquake Faulting, Ground Motions and Deformations. Table 1 Values of constants in Eq. 2 for intraplate 
earthquakes 





B(m/s) 

D 

E 

F 

G(M) 

(Ms) 

(Mw) 

AMAX 

2.8 

1,000 

0.5 

1.5 

0.5 

1.11 

1.16 

VMAX 

0.4 

1,000 

0.5 

1.5 

0.5 

1.11 

1.16 


took place. Faults are broadly classified into three big 
groups, namely, normal faults, thrust faults, and strike- 
slip faults as seen in Figs. 1 and 6. A fault is geologically 
defined as active if a relative movement took place in 
a period of less than 2 million years previously. It is well 
known that a fault zone may involve various kinds of 
fractures, and it is a zone having a finite volume [29]. 
The appearance of ground breaks is closely related to 
geological structure, characteristics of sedimentary 
deposits, their geometry, the magnitude of earth¬ 
quakes, and fault movements. 

It is well known that the earth’s crust is ruptured 
and contains numerous faults and various kinds of 
discontinuities, and it is almost impossible to find 
a piece of land without faults. During the construc¬ 
tion of structures such as tunnels, dams, power 
plants, roadways, railways, power transmission 
lines, bridges, elevated expressways, etc., it is almost 
impossible not to cross a fault or faults. Therefore, 
one of the most important items is how to identify 
which fault segments observed on ground surface 
will move or rupture during an earthquake. A fault 
zone may involve various kinds of fractures as illus¬ 
trated in Fig. 7, and it is a zone having a finite volume 
[29]. In other words, a fault is not a single plane. 
Furthermore, faults may have a negative or positive 
flower structure as a result of their transtensional or 
transpressional nature and the reduction of vertical 
stress near the earth’s surface as shown in Fig. 8. 
For example, even a fault having a narrow thickness 
at depth may cause a quite broad rupture zone 
and numerous fractures on the ground surface during 
earthquakes. Furthermore, the movements of a 
fault zone may be diluted if a thick alluvial deposit 
is found on the top of the fault (i.e., 1992 Erzincan 
earthquake [30]). 

The recent global positioning system (GPS) also 
showed that permanent deformations of the ground 


Earthquake Faulting, Ground Motions and 
Deformations. Table 2 Values of constants in Eq. 3 


Faulting type 

a 

b 

c 

Normal faulting 

30 

0.002 

1.35 

Strike-slip faulting 

20 

0.002 

1.40 

Thrust faulting 

30 

0.003 

1.45 


surface occur after each earthquake (Fig. 2). The per¬ 
manent ground deformation may result from different 
causes such as faulting, slope failure, liquefaction, and 
plastic deformation induced by ground shaking [18]. 
These types of ground deformations will have limited 
effect on small structures as long as the surface breaks 
do not pass beneath those structures. However, such 
deformations may cause tremendous forces on long 
and/or large structures. The ground deformation may 
induce large tensile or compression forces, as well as 
bending stresses, in structures depending upon the 
character of permanent ground deformations. As an 
example, the ground deformations reported by 
Reilinger et al. [31] are shown in Fig. 9, which were 
caused by a strike-slip fault during the 1999 Kocaeli 
earthquake in Turkey. Blind faults and folding pro¬ 
cesses may also induce some peculiar ground deforma¬ 
tions and associated folding of soft overlaying 
sedimentary layers. Such deformations caused tremen¬ 
dous damage on tunnels during the 2004 Chuetsu 
earthquake, although no distinct rupturing took place. 

Effects of Surface Ruptures on Structures 

In this section, typical examples of damage to various 
structures induced by the fault breaks observed in 
recent large earthquakes since 1995 are presented 
[1, 2, 11, 32-50]. 
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Roadways and Railways 

The Trans-European Motorway (TEM) was damaged 
at three different locations by the earthquake fault 
caused by the 1999 Kocaeli earthquake. The motorway 
with east and west bounds having three lanes each was 
slightly elevated through embankments in the earth¬ 
quake-affected region. The surfacing of the motorway 
was damaged by rupturing and buckling (Fig. 10). The 





Earthquake Faulting, Ground Motions and 
Deformations. Figure 7 

Fractures in a shear zone or fault [29] 


railways were originally built on existing ground 
surface but were buckled near Tepetarla station 
where the earthquake fault crossed the railways at 
an angle of 50-55° with well-known £C S” shape 
(Fig. 11). 

Bridges and Viaducts 

Along the damaged section of the TEM motorway 
mentioned above, there were several overpass bridges. 
Among them, a four-span overpass bridge at Arifiye 
junction collapsed as a result of faulting (Fig. 12). 
The fault rupture passed between the northern 
abutment and the adjacent pier. The overpass was 
designed as a simply supported structure according 
to the modified AASHTO standards, and girders had 
elastomeric bearings. However, the girders were 
connected to each other through prestressed cables. 
The angle between the motorway and the strike of 
the earthquake fault was approximately 15°, while 
the angle between the axis of the overpass bridge and 
the strike of the fault was 65°. The measurements of 
the relative displacement in the vicinity of the fault 
range between 330 and 450 cm. Therefore, an average 
value of 390 cm could be assumed for the relative 
displacement between the pier and the abutment of 
the bridge. The Pefong bridge collapsed due to thrust 
faulting in the 1999 Chi-chi earthquake, which passed 
between the piers near its southern abutment as seen 
in Fig. 13. 



Earthquake Faulting, Ground Motions and Deformations. Figure 8 

Negative and positive flower structures due to transtension or transpression faulting (Modified from [2, 11, 29]) 
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Earthquake Faulting, Ground Motions and Deformations. Figure 9 

Permanent ground deformations and associated straining induced by the 1999 Kocaeli earthquake 
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Earthquake Faulting, Ground Motions and Deformations. Figure 10 

Buckling of roadway surfacing 



Earthquake Faulting, Ground Motions and Deformations. Figure 11 

Buckled railways 


Dams 

The Shihkang dam, which is a concrete gravity dam 
with a height of 25 m, was ruptured by thrust-type 
faulting during the 1999 Chi-chi earthquake (Fig. 14). 
The relative displacement between the uplifted part of 
the dam was more than 980 cm. Liyutan rock-fill dam 
with a height of 90 m and a crest width of 210 m, which 


was on the overhanging block of Chelungpu fault, was 
not damaged even when the acceleration records at 
this dam showed that the acceleration was amplified 
4.5 times of that at the base of the dam (105 Gal). The 
deformation zone of faulting during the 2008 
Wenchuan earthquake caused some damage at 
Zipingpu dam with concrete facing. 
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Earthquake Faulting, Ground Motions and Deformations. Figure 12 

The collapse of the overpass bridge at Arifiye 



Earthquake Faulting, Ground Motions and Deformations. Figure 13 

Collapse of Pefong bridge (note the uplifted ground on RHS) 


Tunnels 

The past experience on the performance of tunnels 
through active fault zone during earthquakes indicates 
that the damage is restricted into certain locations 
[18, 41-43]. Portals and the locations where the tunnel 


crosses the fault maybe damaged as had occurred in the 
2004 Chuetsu, 2005 Kashmir, and 2008 Wenchuan 
earthquakes (Fig. 15). A section nearby the Elmalik 
portal of Bolu tunnel collapsed (Fig. 16). This section 
of the tunnel was excavated under very heavy squeezing 
conditions [51]. Well-known examples of damage to 
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Earthquake Faulting, Ground Motions and Deformations. Figure 14 

Failure of Shihkang dam due to thrust faulting 



Earthquake Faulting, Ground Motions and Deformations. Figure 15 

Examples of damaged portals of tunnels 



Earthquake Faulting, Ground Motions and Deformations. Figure 16 

Collapse of Bolu tunnel during the 1999 Duzce earthquake 
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tunnels at locations, where the fault rupture crossed the 
tunnel, are mainly observed in Japan. The Tanna fault 
ruptured during the 1930 Kita-Izu earthquake and 
caused damage to a railway tunnel, with a relative 
displacement of about 100 cm [52, 53]. The 1978 
Izu-Oshima Kinkai earthquake induced damage to 
Inatori railway tunnel [54, 55]. A similar type of dam¬ 
age with a small amount of relative displacements due 
to motions of Rokko, Egeyama, and Koyo faults to the 
tunnels of Shinkansen and subway lines through Rokko 
mountains was also observed [33]. During the 1999 
Chi-chi earthquake, the portal of the water intake tun¬ 
nels was ruptured for a distance of 10 m as a result of 
thrust faulting. Except this section, the tunnel was 
undamaged for its entire length. 

Jiujiaya tunnel is a 2,282-m-long double-lane tunnel 
226.6 km away from the earthquake epicenter and about 
3-5 km away from the earthquake fault of Wenchuan 
earthquake. The tunnel face was 983 m from the south 
portal at the time of the earthquake. The concrete lining 
follows the tunnel face at a distance of approximately 
30 m. Thirty workers were working at the tunnel face, 


and one worker was killed by the flying pieces of rock 
bolts, shotcrete, and bearing plates caused by intense 
deformation of the tunnel face during the earthquake 
[17, 18]. The concrete lining was ruptured and fell 
down at several sections (Fig. 17). The effect of the 
unreinforced lining rupturing was quite large and intense 
in the vicinity of the tunnel face. The rupturing of the 
concrete lining generally occurred at the crown sections, 
although there was rupturing along the shoulders of the 
tunnel at several places. Furthermore, the invert was 
uplifted due to buckling at the middle sections. 

Subways 

The behavior of subways in active fault zones is basi¬ 
cally quite similar to that of tunnels. The Daikai station 
of the subway line in Kobe was caused by the lateral 
strike-slip movement of Egeyama fault just beneath this 
station, although some have tried to associate the col¬ 
lapse of the station with the intensity of shaking 
(Fig. 18). The investigation of the collapse of this sta¬ 
tion showed that collapse was not due to shaking, as the 
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Earthquake Faulting, Ground Motions and Deformations. Figure 17 

Earthquake damage at Jiujiaya tunnel due to permanent deformations [18] 
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Earthquake Faulting, Ground Motions and Deformations. Figure 18 

The collapse of Daikai station 



1999 Kocaeli earthquake 1995 Kobe Earthquake 1999 Chi-chi Earthquake 

Earthquake Faulting, Ground Motions and Deformations. Figure 19 

Damage to pylons due to faulting, (a) 1999 Kocaeli earthquake, (b) 1995 Kobe earthquake, (c) 1999 Chi-chi earthquake 


central columns of the station were subjected to tor¬ 
sional failure due to the permanent ground displace¬ 
ment, consistent with the lateral strike-slip movement 
of Egeyama fault [33]. 

Power Transmission Lines 

Power transmission lines generally consist of pylons 
and power transmission cables. The design of pylons 


and cables is generally based on the wind loads 
resulting from typhoons or hurricanes. The cables do 
not fail during earthquakes unless the pylons are top¬ 
pled due to faulting, shaking, or slope failure. During 
the 1999 Kocaeli earthquake, only one pylon was dam¬ 
aged, at a site near Ford-Otosan automobile factory at 
Kavakli district of Golciik town. At this site, a normal 
fault, secondary to the main lateral strike-slip faulting 
event, crossed through the foundations of the pylon 
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and with a vertical lift of about 240 cm. One of the 
foundations of the pylon was pulled out of the ground 
and exposed (Fig. 19a). Some of its truss elements were 
slightly buckled. A similar type of damage to pylons 
straddling the Nojima fault break in the 1995 Kobe 
earthquake (Fig. 19b) and the Chelungpu fault during 
the 1999 Chi-chi earthquake has been observed as 
shown in Fig. 19c. 


Line-Like and Tubular Structures 

Tubular structures may be specifically designated as 
petrol and gas pipelines, water pipes, and sewage sys¬ 
tems. They can be also classified as line-like structures. 


These structures may fail either by buckling or sepa¬ 
ration during a faulting event. Five such incidents 
were observed during the 1999 Kocaeli earthquake 
(Fig. 20). One of the incidents involved separation of 
a ductile iron pipe as a result of faulting near the 
collapsed overpass bridge. The second incident took 
place at the pumping facility of the Seka paper mill 
plant at Sapanca Lake. The third incident occurred 
near Tepetarla village where the railways buckled. 
The fourth and fifth incidents took place at Arifiye 
and nearby Ba§iskele. The fifth incident was quite 
important, since the fault caused a heavy damage to 
the main water pipe having a diameter of 2 m. Similar 
types of failures took place in the sewage pipe 
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Damage to pipes in the 1999 Kocaeli and 1999 Duzce earthquakes 
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networks whenever faulting breaks were observed. The 
natural gas pipelines crossing the Izmit gulf between 
Yalova and Pendik were undamaged. Some brittle 
asbestos water pipes were also damaged in Kayna§h 
and Fmdikli due the fault rupture of the 1999 Diizce 
earthquake. 

Buildings 

Many buildings along the earthquake faults of the 1999 
Kocaeli, Diizce, Chi-chi, and Wenchuan earthquakes 
were affected in a similar manner. During the site 
investigation of the 1999 Kocaeli earthquake, one 
could see totally collapsed, severely damaged, or intact 
buildings on or adjacent to the traces of the fault 
breaks. Examples are many; it is difficult to note all of 
them. Two typical examples are briefly discussed. Nev¬ 
ertheless, this topic deserves more thorough investiga¬ 
tion. The first example is a single-story reinforced 
concrete house with a raft foundation in Fmdikli vil¬ 
lage. The fault passed just underneath the building 
(Fig. 21a). The relative displacement of the fault 
break was about 200 cm with a 100-cm downthrow. 
The building was tilted, but no damage was observed. 
A very peculiar type of damage at an apartment com¬ 
plex consisting of 8 five-story apartment blocks was 
observed in Kullar village as shown in Fig. 21b. Seven 
apartment blocks failed in a pancake mode, while one 
apartment block remained standing. One of the failed 
apartment blocks just crossed the fault break which 
has a relative horizontal displacement of 240 cm 
and 20-25 cm vertical throw (north side down). 


The ground surface sloped to north. One of two apart¬ 
ment blocks on the southern side was damaged, while 
the other one collapsed toward the east in a pancake 
mode in accordance with the movement of its foun¬ 
dation. The five blocks on the northern side 
completely collapsed in a pancake mode toward west 
in accordance with the direction of shaking. Except for 
the apartment block over the fault break, the failure of 
the five apartment blocks on the northern side of the 
fault break may be considered to be purely due to 
shaking. The Bailu secondary school building of 
reinforced concrete survived the Wenchuan earth¬ 
quake, even though the surface rupture of the thrust 
fault passed underneath the building (Fig. 21c). 
Although the intensity of shaking on the southern 
side of the fault break should be the same, the damaged 
self-standing apartment block deserves further consid¬ 
eration. Whatever the reason, it is of great interest that 
even the most vulnerable buildings may also survive 
within a distance of 5-6 m from the fault break during 
inland earthquakes. 

Landslides and Rockfalls The recent large earth¬ 
quakes caused megascale slope failures and rockfalls, 
particularly along the surface ruptures on the hanging- 
wall side of faults. The slope failure induced in 
Beichuan town during the 2008 Wenchuan earthquake 
[40, 50] is of great interest. In association with the 
sliding motion of the earthquake fault, NW- or 
SE-facing slopes failed during this earthquake. There 
were two large-scale slope failures (landslides) in 
Beichuan town, which destroyed numerous buildings 
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The behavior of reinforced concrete buildings over surface ruptures, (a) Duzce earthquake, (b) Kocaeli earthquake, (c) 
Wenchuan earthquake 
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Earthquake Faulting, Ground Motions and Deformations. Figure 22 

Views of landslides in Beichuan 


and facilities. The NW-facing landslide (Jingjiashan) 
involved mainly limestone, while the SE-facing land¬ 
slide (Wangjiaya) involved phyllite (mudstone, 
according to some) rock unit (Fig. 22). Limestone 
layers dip toward the valley side with an inclination of 
about 30°. There are several faults dipping parallel to 
the failure surface within the rock mass. The angles of 
the lower and upper parts of the failed slope are 60° and 
30-35°, respectively. The existence of several faults 
dipping parallel to the slope with an inclination of 
about 60-65° creates a stepped failure surface. 

The SE-facing slope (Wangjiaya landslide) may 
involve slippage along the steeply dipping bedding 
plane (fault plane?) and shearing through the layered 
rock mass. In other words, it may be classified as 
a combined sliding and shearing effect [56] . The angles 
of the lower and upper parts of the failed slope are 
40-45° and 30-35°, respectively. The layers dip at an 
angle of 40° toward the valley, and the shearing plane is 
inclined at an angle of 20°. 


Future Directions 

The characteristics of ground motions and deforma¬ 
tions associated with earthquake faulting are presented 
in this chapter. Furthermore, the response and damage 
to structures due to surface ruptures are illustrated 
through actual observations in recent great 
earthquakes. 

The advanced strong-motion attenuation relations 
by Aydan and Ohta [4] seem to explain the wide range 
of observational data, which are considered to be 
scattering when spherical or cylindrical symmetric 
attenuation relations are used. A multisegmented or 
multisource model may be used for estimating strong 
motions [4] if surface ruptures due to earthquakes are 
relatively long. 

The GPS measurements of ground deformations 
during earthquakes (M > 6) clearly indicated that 
permanent ground deformations do occur. The overall 
deformation response involves both coseismic and 
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time-dependent components. The permanent ground 
deformation may result from different causes such as 
faulting, slope failure, liquefaction, and plastic defor¬ 
mation induced by ground shaking. They may cause 
tremendous forces on long and/or large underground 
structures such as tunnels, powerhouses, and under¬ 
ground storage facilities of oil, gas, and nuclear wastes. 
Therefore, the current seismic design codes must be 
revised to include these effects. 

As a conclusion, it is almost impossible for mankind 
to prevent the damage to structures if a fault break 
happens to be just passing underneath the structures. 
Nevertheless, the adverse effects of fault breaks on 
structures can be minimized to a negligible level in 
view of findings from the actual examples presented 
in this chapter. The following suggestions may be put 
forward for developing the seismic codes for structures 
in active fault zones: 

1. Roadways and railways should not be elevated, and 
they should be constructed on the existing ground 
surface in active fault zones. In doing so, recovery of 
roadways and railways damaged by fault breaks in 
the event of the earthquakes would be very rapid. 
As a result, effective rescue and rehabilitation works 
would be possible. 

2. Bridges and viaducts should be constructed as 
redundant structures, and simply supported struc¬ 
tures should be avoided. Although prestressing can 
be used to increase the confinement of girders along 
the longitudinal axis of the bridge, the structures 
should not be wholly prestressed to prevent domino 
action in the event of failure. Furthermore, T-type 
piers must not be used as they constitute top-heavy 
situations which are undesirable in case of faulting 
as well as shaking caused by earthquakes. 

3. Dams should never be built on active faults defined 
in engineering sense because of the risk of severe 
impacts to structures and residential areas down¬ 
stream in the event of failure. If for some reason 
dams must be constructed in faulted zones, their 
height should be kept to a minimum and the type of 
dams should be rock-fill. 

4. Observations and studies on various underground 
openings show that they are resistant to shaking. 
Nevertheless, the existence of discontinuities makes 
them vulnerable to collapse, particularly in case of 


shallow underground openings. This may have 
some important implications on areas where shal¬ 
low abandoned mines, underground shelters, and 
old tunnels exist. As damage to tunnel linings due 
to fault breaks would be localized, there is no need 
for extra precautions at those locations. Under¬ 
ground openings crossing faults and fracture 
zones may be enlarged to accommodate relative 
slips along faults and fracture zones. The lining of 
the openings should be ductile to accommodate 
permanent ground deformations at such zones. 
Furthermore, the brittle linings of the existing 
underground structures should be lined with duc¬ 
tile thin plates or fiber-reinforced polymers 
together with rock bolts at fracture and fault 
zones, where permanent ground deformations 
may occur. 

5. Tubular structures should be designed with the use 
of flexible joints to accommodate relative displace¬ 
ments with the consideration of likely relative dis¬ 
placement of the ground. 

6. Buildings should be designed as rigid box-like struc¬ 
tures with the use of shear walls. Walls with fragile 
hollow bricks having no structural strength must not 
be allowed during building construction. If the brick 
walls are to be used, the walls should be constructed 
before the construction of columns and beams in 
order to attain the necessary structural integrity, 
and they should be light, resistant, and ductile. 

7. Vertical accelerations of inland earthquakes can be 
0.65-0.75 times the maximum horizontal accelera¬ 
tions; sometimes they may even exceed maximum 
horizontal acceleration. The effect of vertical accel¬ 
erations must be considered in structural design. 

8. Structures of great importance should be sited on the 
rock foundations rather than on loose alluvial 
ground, since amplification of movement in loose 
ground maybe several times that of rock foundation. 
The failure risk of structures during earthquakes 
could be drastically reduced by such a selection. 
Nuclear power plants and even arch dams built on 
rock foundation performed very well without any 
damage during the recent great earthquakes. 

9. Global, regional, and local geological and geotech¬ 
nical maps of cities and towns for residential and 
industrial developments in seismically active fault 
zones should be prepared. Furthermore, detailed 
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geological and geotechnical investigations should 
be required before any new development to reduce 
the seismic risks to structures during earthquakes in 
active fault zones. 
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The goal of much of modern toxicology and pharma¬ 
cology has since been to define the precise biological 
factors that dictate the relationship between dose and 
response following exposure to chemicals. One of the 
major challenges in risk assessment more recently 
appreciated is the difficulty in predicting with statisti¬ 
cal certainty adverse effects at low doses when the 
probability of response in a population is low [ 1 ]. The 
nature of the risks at low doses may be qualitatively 
different from the nature of the risks at high doses. In 
other words, the risk at low doses is not just a smaller 
version of the risk at high doses, because the mecha¬ 
nism^) responsible for the risk at low doses is often 
different. 
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Glossary 

Acclimation A decline in response with previous 
exposure. 

Adaptation Physiological changes that increase the 
fitness of a population in response to chronic 
exposure. 

Epigenetic Mechanisms of altered gene expression 
that do not change primary DNA sequence. 

Hormesis A beneficial effect from a low dose of 
a “poison” or contaminant. 

NOAEL No-observed-adverse-effect level is the 
highest treatment level from which the response 
does not differ statistically from control groups. 

Threshold dose Dose required to produce 
a measurable response. 

Definition of Subject 

The concept of “dose makes the poison” has been 

attributed to Paracelsus who lived in the 1500s. 


Paracelsus (1493-1541), or Theophrastus Phillippus 
Aureolus Bombastus von Hohenheim, is often credited 
as being the father of modern toxicology for develop¬ 
ment of the concept of a dose-response relationship. 
Paracelsus is quoted as saying “all things are poison and 
nothing is without poison; only the dose makes a thing 
not a poison” [2]. This principle tells us that even the 
most innocuous of substances, water for example, can 
be dangerous in excess. The public is generally aware 
that dehydration, or an excess of salt relative to water, 
can be life threatening. However, hyponatremia, an 
excess of water in the blood, is less widely accepted as 
a potentially fatal condition. As recently as 2007, 
a contestant in a radio contest died after consuming 
6 1 of water in 3 h [3]. The idea that the amount of the 
exposure or dose determines the frequency or magni¬ 
tude of response is the fundamental basis for most 
toxicological research. In theory and practice, the prin¬ 
ciple that the dose determines the poison is the foun¬ 
dation for experimental toxicology even to this day. 

Historically, the relationship between dose and 
response was considered to be in one direction (i.e., 
an increase in response with dose or a decrease in 
response with dose). This single direction, or mono¬ 
tonic, relationship is based on both an understanding 
of biological processes and mathematical descriptions 
of experimental outcomes [4]. The most common 
description of the relationship between dose and 
response is a sigmoidal curve (Fig. 1). This relationship 
describes the response expected from the log of the 
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Ecological and Health Risks at Low Doses. Figure 1 

The classical sigmoidal dose-response curve has doses or 
concentrations that elicit a 50% response (EC50). 
Theoretically, there is a threshold concentration or dose 
below which the response is the same as controls. This 
spontaneous response is the background rate in the 
absence of treatment 

dose or concentration. Under this relationship, the 
lowest doses or exposures elicit no measureable 
response. In general, and in agreement with Paracelsus, 
organisms have some tolerance for exposure to poisons 
and toxicants. Some “threshold dose” is required to 
produce a measureable response. The response then 
increases exponentially until the midpoint of the 
curve, where it is half of the maximal response, 
a point referred to as the EC50 (effective concentration 
to produce 50% response). The rate of increase in the 
response decreases above the EC50 to zero where the 
response reaches its maximum. Increased doses or con¬ 
centrations above the maximum (EC 100) will not 
increase the response. The sigmoidal curve is the 
default assumption for the relationship between dose 
and response. 

Despite this general convention, agreement on the 
shape of the dose-response curve at the low end has 
been controversial [4-6]. The classic sigmoidal dose- 
response relationship implies that there is a threshold 
for a biological response. Below that threshold value, 
exposure to a toxicant should have no measureable 
effect. Some responses to toxicants, such as the occur¬ 
rence of tumors, occur spontaneously in control 
groups. In the absence of exposure, this rate of occur¬ 
rence is considered the “background” level (Fig. 1). If 
an incremental dose increases this background rate of 


response, then the relationship between the incremen¬ 
tal dose and response can be fitted linearly [4]. Fur¬ 
thermore, early studies of the biological basis of tumor 
formation led to the idea that a single molecule of 
a carcinogen could cause a mutation that led to 
tumor development. These “one hit” models have 
been disputed with both theoretical and physical lines 
of evidence [4]. For low-energy radiation, theoretical 
studies indicate that at least two hits of radiation are 
necessary for mutagenesis, potentially due to high 
DNA repair rates. Furthermore, cancer research indi¬ 
cates that tumor initiation requires a series of promo¬ 
tional changes even after mutations are incorporated 
into the genome. 

For carcinogens, the precautionary assumption has 
been that every dose can elicit a response. In other 
words, there is not an observable no-effect threshold. 
The most common approach for estimating the low- 
dose effects of a carcinogen is to adequately describe the 
relationship between exposure and response over some 
range of treatments and then extrapolate to lower 
doses, generally using a linear function. If all poisons 
have a threshold dose, the linear extrapolation of low- 
dose response is a conservative assumption. There are 
reports of DNA adduct formation in proportion to dose 
even at extremely low concentrations. The US Environ¬ 
mental Protection Agency (EPA) has adopted a policy to 
utilize the formation of DNA and protein adducts 
as a means to extend the understanding of the shape of 
the dose-response relationship beyond the range that 
can be investigated with cancer bioassays [7]. The 
relationship between formation of DNA adducts and 
physiologically meaningful formation of tumors is not 
as clear. A review of the literature on biomarkers indi¬ 
cates that the dose-response relationship for bio- 
markers of exposure is not the same as the 
relationship for biomarkers of effect [7]. Recent 
descriptions of this debate include the historical pro¬ 
gression of acceptance of linear extrapolation by the 
EPA [8]. 

In contrast to carcinogens, the assumption of a no- 
effect threshold has been used extensively for chemicals 
that are not considered carcinogenic. For these 
chemicals, the dose at which no adverse effects are 
detectable is a critical value incorporated into risk 
assessments. This dose is called the no-observed- 
adverse-effect level or NOAEL. The NOAEL is derived 
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from experimental data; it is the highest treatment level 
which does not differ statistically from the control 
groups. NOAEL is the basis for regulatory guidance 
such as acceptable intake levels. Data limitations and 
experimental design can limit the utility of the NOAEL. 
For example, note the chosen test concentrations 
represented by the gray vertical boxes in Fig. 2 for the 
NOAEL and the lowest observed adverse effects level 
(LOAEL). The doses in this hypothetical experiment 
might have been chosen to produce the maximum 
spread of responses or cover the relevant dose range 
for this compound. However, a higher concentration 
may have been chosen which would have still resulted 
in no significant response - and therefore a different 
value for the NOAEL. An alternative approach is to use 
a benchmark dose (BMD) [9]. The BMD is a derived 
value based on the effective range of the dose-response 
curve. Calculated BMD have the benefit of using 
a maximum likelihood approach to determine 
a confidence interval for the value. The variance in 



Log [Concentration] 

Ecological and Health Risks at Low Doses. Figure 2 

The controversial low end of the dose-response curve is 
shown here. Experimental data determine the "lowest 
observed adverse effects level" (LOAEL) and the "no- 
observed-adverse-effects level" (NOAEL), indicated by the 
vertical gray boxes. Linear extrapolation from the LOAEL 
0 dashed line) is a conservative estimation of the low-dose 
effects. Threshold dose can be estimated statistically using 
a lower bound benchmark dose (BMDL) through 
a proportion of the response (gray bars). In this 
hypothetical case, all extrapolations would overestimate 
toxicity if there were a hormetic response [gray arrow) 


the response to the LOAEL or the EC50 can be used 
to estimate the concentration resulting in 5% response 
(or another level). 

Extensive evidence now indicates that often the 
relationship between dose and response is biphasic. In 
other words, low-dose treatments may cause a decrease 
in response while higher doses cause an increase in 
response. An extensive review of the toxicological liter¬ 
ature [10] found thousands of cases of these types of 
nonmonotonic dose-response relationships. In many 
of these cases, low-dose treatments with a toxicant 
induced a beneficial response - fewer tumors than 
unexposed controls, for example. A beneficial effect 
from a low dose of a “poison” or contaminant is 
referred to as hormesis. These beneficial effects have 
been reported over a range of responses from growth 
rate to carcinogenesis. Modeling studies indicate that 
hormetic response can be driven by a summation of the 
dose-response relationships for different steps in the 
biological response, specifically receptor binding, tran¬ 
scriptional activation, rate of DNA repair, and rate of 
cell division [5]. An example of how multiple physio¬ 
logical responses work together to result in a hormetic 
response is described below. 

Alternatively, low-dose exposure may stimulate an 
increase in response while higher dose produces a lesser 
response. This “inverted U” shaped dose-response 
curve has also been observed in many cases. This rela¬ 
tionship between dose and response is characterized by 
a heightened response to low doses and a depressed 
response to higher doses. A hypothetical example 
might be an increase in reproduction observed in 
response to a low concentration of estrogen and 
suppressed reproductive rates observed with higher 
treatments. The more complicated relationships 
between dose and response demonstrated in cases of 
hormesis and “inverted U” curves are likely to be 
observed in systems that require multiple steps to pro¬ 
duce the response [5]. Systems biology and genomic/ 
proteomic studies provide an opportunity to test sum¬ 
mation hypotheses in a rigorous manner [1, 11]. 
Modeling and circuitry analysis provide an opportu¬ 
nity to test, or to generate new predictions about the 
responses of cells, individuals, and populations. 

Further complicating the relationship between dose 
and response is the issue of exposure to mixtures [12]. 
Some cocontaminants will have the same mechanisms 
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of action and interact with the same receptors, while 
others will interact antagonistically through similar 
mechanisms of uptake, circulation, transformation, or 
elimination. EPA guidance recommends dose addition 
for chemicals which have similar mechanisms of action 
or exhibit response addition. However, predicting the 
toxicity of mixtures is difficult even when the mecha¬ 
nisms of action are understood. 

The focus or target of a research study has 
a significant impact on predictions related to the 
dose-response relationship. Many cellular responses 
are either all or none switches (i.e., cell division). 
These cellular responses are described as being “quan- 
tal,” and the sensitivity of the response in these cases 
will be described as a cumulative frequency of response 
at each dose. How the frequency of these incidences 
relates to specific biological response and adverse 
health outcomes is largely an unresolved issue [11]. 
By contrast, examination of a population of cells will 
produce a smooth dose-response curve as individual 
cells respond at different concentrations. An analogous 
dose-response relationship can be described in epide¬ 
miological studies [6]. Individuals participating in 
a study will either have a tumor or not; their response 
to their particular dose is quantal. However, different 
individuals will have different sensitivities based on 
their age, health, lifestyle, and previous exposure his¬ 
tory [4]. Each individual is likely to have a different 
“threshold” for tumor incidence based on their situa¬ 
tion. Summation of these different thresholds is 
expected to “smooth” the dose-response relationship 
in studies that include a diverse population. Indeed, 
most human epidemiological studies have a linear rela¬ 
tionship between dose and response [6]. 

Physiological Response to Low Doses 

As described above, the response of an organism to 
a toxicant may be nonmonotonic. One range of treat¬ 
ment may suppress a response while another may 
enhance the response [12]. Physiological mechanisms 
underlying these complicated hormetic responses are 
of much interest. The long history of these types of 
responses has been documented by Rozman and Doull 
[13]. Their review summarizes the notion that 
a beneficial response to a low dose of a poison is the 
result of homeostatic compensation in response to 


a stressor. Rozman and Doull make an analogy between 
low-dose treatment and physical exercise. Both are 
minor stressors and homeostatic response to those 
stimuli may ultimately enhance fitness. Adaptation to 
a low dose of a stressor, like recovery from physical 
exercise, may be the result of diverse physiological 
mechanisms. As a general phenomenon, adaptation 
or compensation induced by a low-dose treatment is 
believed to be the basis for hormesis [14]. 

In situations that result in a hormetic response, 
continuous exposure to a relatively constant dose 
should result in a continuous state of (over)compensa¬ 
tion. This response has been described as steady-state 
hormesis induced by chronic exposure [15]. A critical 
cellular aspect involved in the maintenance of homeo¬ 
stasis is the suite of biotransformation and conjugation 
enzymes [ 16] . Three categories of enzymes, Phase I—III, 
are the basis for xenobiotic metabolism. Phase I 
enzymes include the cytochrome P450 family, which 
classically oxidize xenobiotics rendering them more 
water soluble. However, these Phase I enzymes also 
activate some procarcinogens and mutagens to reactive 
metabolites. Phase II enzymes conjugate solubilizing 
groups that further facilitate excretion. Phase III 
enzymes are membrane-bound transporters that 
export metabolized xenobiotics from the cell. 
A model of these xenobiotic metabolizing enzymes 
(XMEs) by Zhang and coworkers [17] has generated 
a mechanistic hypothesis for the hormetic response to 
many mutagens and carcinogens. 

Zhang et al. [17] have described a multipart negative 
feedback and feedforward model of XMEs, including 
cross-induction of Phase I and II enzymes. Simulations 
of this model find a drop in the production of a reactive 
metabolite below baseline levels in response to 64 h of 
exposure to an exogenous procarcinogen. Activation of 
a homeostatic control mechanism, in this case tran¬ 
scriptional induction of XME in response to binding of 
a nuclear receptor (the aryl hydrocarbon receptor), will 
compensate for the poison and produce a new physio¬ 
logical state. Because of this hormetic compensation, 
the authors conclude that the response of biological 
systems will not be “linearly correlated with the dose of 
the stressing agent, especially in the low dose region” 
[17]. The regulation and induction of Phase I and II 
enzymes is a common cellular mechanism for coping 
with toxicants. In the cases where Phase I metabolism 
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activates a toxicant, the feedforward induction of these 
enzymes is a potential mechanism for hormetic 
responses at the cellular level. This model provides 
a hypothesis for the cellular basis of suppression of 
background levels of tumor induction in response to 
low-dose exposure to carcinogens to the extent that 
reactive metabolites are correlated with mutagenesis, 
apoptosis, and cell proliferation. 

Low-level or subacute doses have the potential to 
produce long-term changes in the magnitude of the 
response elicited from a toxicant. A decline in response 
with exposure is called acclimation. The process of 
acclimation is best studied in relation to exposure of 
fish to heavy metals (reviewed by Hamilton and Mehrle 
[18]). Pretreatment of animals with metals, specifically 
cadmium, copper, mercury, and zinc, is protective for 
subsequent exposures. Exposure induces synthesis of 
thionein, a protein that normally occurs only in trace 
amounts in certain tissues (blood, gills, liver, kidney, 
and intestine). Binding of metals to this protein, then 
called metallothionein, reflects the relative concentra¬ 
tions of the metals present and their binding affinity. 
Chronic toxicity of metals that are sequestered by 
metallothionein has been described as “spillover”; 
once metallothioneins metal binding sites are satu¬ 
rated, the excess will cause tissue damage. This example 
of competition for a target will influence the shape 
of the dose-response curve at the low end. 
Metallothionein is induced by metal, and to a lesser 
extent, by glucocorticoids. Therefore, some protection 
or acclimation to metals may be produced by a number 
of different stressors that produce a glucocorticoid 
response. In this case, exposure to some classes of 
metals or heightened stress response will alter the 
threshold dose for tissue damage from toxicants such 
as mercury, cadmium, and silver. 

Different populations will vary in their response to 
a toxicant, especially at the low range of the dose- 
response curve. If a mathematical description of the 
response indicates that 5% of the population should be 
impacted, a change in sensitivity of the population (not 
to mention sample size and health-related factors) will 
result in more variability in the reported response rate. 
Populations may differ in age, or individuals may 
have encountered a previous exposure as described 
above. Additionally, populations may have a history 
of chronic exposure that results in transgenerational 


changes in the dose-response curve. Genetic or epige¬ 
netic adaptation to a chronic stressor will impact the 
response of a population to exposure. Resistant phe¬ 
notypes may have enhanced fitness under chronic 
exposure conditions. Those phenotypes would be 
expected to increase in frequency in subsequent 
generations. Experimental studies of killifish from 
a creosote-contaminated Elizabeth River (Virginia) 
location have documented resistance to the teratogenic 
effects of creosote exposure which is maintained 
through two generations of animals reared in clean 
conditions [19]. Transgenerational resistance to the 
polycyclic aromatic hydrocarbons (PAHs), which 
make up the majority of creosote, is not associated 
with a general increase in resilience. In fact, offspring 
of Elizabeth River-collected killifish exhibited higher 
mortality in response to hypoxic conditions. This study 
reveals that the magnitude of resistance to PAHs 
decreases with subsequent laboratory-raised genera¬ 
tions (F2 vs. FI); therefore the mechanism of adapta¬ 
tion might not be conferred genetic resistance, but 
instead be a heritable change in gene regulation. 

Epigenetic mechanisms are molecular phenomena, 
such as DNA methylation of CpG nucleotides, which 
regulate gene expression without changing the DNA 
sequence itself [20]. Methylation patterns are normally 
established two times during development in somatic 
cells during gastrulation and in the germ cells after sex 
determination [21]. Many common diseases (e.g., dia¬ 
betes, osteoporosis, atherosclerosis etc.) have a 
sex-related incidence bias that may be a result of sex- 
specific (e.g., X-chromosome inactivation) epigenetic 
control [22]. As reviewed by Gabory and coworkers, 
environmental factors including xenobiotics, nutrition, 
or social factors can influence epigenetic marks 
throughout development [22]. Furthermore, when 
germ cells are adversely epigenically reprogrammed, 
sex-specific transgenerational effects have been 
reported. For example, when pregnant rats were 
exposed to the antiandrogen, vinclozolin, all subse¬ 
quent generation (F1-F4) males had sperm cell defects 
and subfertility [21, 23]. Epigenetic changes are also 
becoming well-recognized mechanisms contributing to 
cancer and have been associated with all stages of 
tumor formation and progression [24]. Losses of 
DNA methylation in oncogenes, hypermethylation 
of tumor suppressor genes, and altered histone 
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modifications have all been noted during oncogenesis. 
The extent to which these epigenetic modifications of 
gene expression and/or their multigenerational conse¬ 
quences mediate xenobiotic low-dose effects are still 
largely unknown. 

Specific Examples 

While a list of xenobiotics that display different effects 
at high and low doses is likely to grow as “omic” tools 
and systems biology approaches are implemented in 
risk assessment, there are a number of examples already 
known. The bulk of the initial work on predicting low- 
dose effects was done with respect to carcinogen risk 
assessment [8]. Radiation and polycyclic aromatic 
hydrocarbons (PAHs) will be used to highlight issues 
in cancer low-dose predictions. Similar to cancer pro¬ 
gression, neurotoxicological risk assessment has 
acknowledged that neurological effects are rarely 
unifactoral [25, 26]. Here, lead and organophosphate 
pesticides will be used as case studies. A focus of 
a National Toxicology Program Peer Review [27], 
certain endocrine disrupting chemicals including 
bisphenol A (BPA) also have been recognized for their 
potential to cause “biological change” at low doses. 
Accordingly, the specific examples described below 
will focus on carcinogens, neurotoxicants, and endo¬ 
crine disruptors. 

A recent review by Calabrese has provided the his¬ 
tory of carcinogen risk assessment [8]. As he describes, 
the basis for the idea that carcinogen effects were linear 
at low doses was derived from the early work of 
Hermann Muller [28], wherein it was discovered that 
X-ray radiation induced mutations in Drosophila. The 
relationship between low-dose linearity and radiation- 
induced mutations in germ cells was further asserted in 
the 1956 Biological Effects of Atomic Radiation 
(BEAR) Report. The idea that mutations in somatic 
cells could lead to cancer also with a linear response at 
low doses was brought to public debate following pub¬ 
lication of E.B Lewis’ paper relating radiation risks and 
leukemia [29]. The criticisms of this manuscript are 
reviewed by Calabrese [8]; nonetheless, it was a key 
contribution to the adoption of a precautionary 
approach and low-dose linear risk assessment for 
both radiation and ultimately chemical carcinogens. 
The current risk assessment paradigm for radiation 


exposure includes a “single-cell origin, a radiation- 
induced mutation/aberration as the rate-limiting step, 
simple expectations of a linear non-threshold radiation 
response (LNT) at low doses and little or no other 
relevant contributions.. .to promotion or progression 
of the tumor” [30]. As pointed out by Goodhead, 
however, there are a number of radiobiological phe¬ 
nomena including bystander effects and genome insta¬ 
bility that are not yet completely understood that may 
impact how risk of low doses should be assessed and 
ultimately how human radiation protection is 
performed. Bystander effects, effects in neighboring 
nonirradiated cells, are most apparent in the low-dose 
range where overt toxicity by direct radiation is not 
dominant [31]. The consequences of bystander effects 
would be supposed to be relevant to carcinogenicity in 
that they could be detrimental (toxic effects or muta¬ 
tions in adjacent cells) or beneficial (apoptosis of 
impacted cells, so no additional tumor progression 
possible); however, how to mechanistically account 
for their role in dose-response relationships and risk 
assessment is currently unknown [30]. Similarly, de 
novo genetic instability in cells generations after irradi¬ 
ation appears to display nonlinear dose responses, and 
the mechanisms are also unclear. Further research will 
certainly be necessary to establish the relevance of these 
more recently recognized potentially low-dose effects 
of radiation and whether a change in the radiation risk 
assessment paradigm is necessary [30]. 

With respect to chemical carcinogens, the Guide¬ 
lines for Carcinogen Risk Assessment [32] establish 
a two-step process for dose-response assessment. 
First, animal or human effect data are used to model 
biological effects such as adduct, mutation, or tumor 
formation. The second step then is to extrapolate below 
the range of observation to the expected range of 
human exposure with considerations as appropriate 
or available to mode of action of the chemical, quality 
of data, and uncertainties therein. Because of the lack of 
human data, the high costs and low sensitivity of 
rodent cancer studies, and the goal to protect against 
one cancer per million, extrapolation of dose back five 
orders of magnitude is sometimes necessary [33]. Fur¬ 
thermore, it has recently become apparent that bio- 
markers of exposure (e.g., DNA adducts; linear 
response) do not have the same dose-response curve 
shape as biomarkers of effect (e.g., mutations; 
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threshold response) [7]. As reviewed by Swenberg 
et al., acrylamide, methymethanesulphonate, and eth¬ 
ylene oxide all represent examples wherein different 
exposure and effect curves have been reported [7]. 
Similarly, in a recent 40,800 animal study investigating 
ultralow-dose effects of the PAH dibenzo[a,1] pyrene in 
rainbow trout, the initial carcinogen-DNA adduct and 
the hepatic tumor response curves were “distinctly 
different” leading the authors to conclude that DNA 
adducts did not “provide an accurate biomarker for 
ultralow dose cancer risk evaluation” [33]. Addition¬ 
ally, the study design necessitated significantly less 
extrapolation below the modeled data set (200-fold 
vs. typical 100,000-fold) and found that the virtual 
safe dose for liver tumor formation was 500-1,500 
times higher than the standard linear extrapolation 
from the lower confidence limit on the dose causing 
10% incidence typically used in carcinogen risk assess¬ 
ment [33]. Mechanistically, the difference in these 
curves is relatively easy to appreciate. A specific PAH- 
DNA adduct will reflect exposure to that parent PAH 
exclusively. Furthermore, there exist very sensitive 
analytical methods of DNA adduct measurement. 
However, it is known that many adducts are not 
promutagenic. When mutagenic adducts formed, the 
excretion and repair rates of a particular adduct may be 
variable because of multiple gene interactions; yet, 
a relationship between adduct formation and initial 
exposure will exist. In contrast, mutations are caused 
by a number of initiating events. There always exists 
some background of mutations caused by endogenous 
chemicals or mistakes in DNA replication. Therefore, 
while physiologically more significant, mutation mea¬ 
surement in exposure experiments will ultimately 
approach a background frequency at low doses [7]. As 
these recent reports indicate, the way to measure and 
predict risks due to exposure to carcinogens is still 
relevant and continually being reevaluated even 
though this was one of the first areas in toxicology 
research to appreciate the need to estimate effects at 
low doses [8]. 

Effects of low doses are also important for acute 
acting compounds. Humans have used and been 
exposed to lead for thousands of years, and at high 
doses, a range of adverse effects can occur [34]. In 
humans, effects depend on developmental stage with 
the central nervous system as a target in children 


and peripheral neuropathies, nephrotoxicity, and car¬ 
diovascular effects common in adults [35]. Recent 
emphasis has been on how childhood low-dose, envi¬ 
ronmentally relevant lead exposure affects cognitive 
function. As more data were collected on lead’s low- 
dose toxicity, the regulatory limit of concern for child¬ 
hood lead intoxication reduced from its initial value of 
<60 pg/dl set in the 1960s, to 40 pg/dl in 1971, to 30 pg/ 
dl in 1978, and 25 pg/dl in 1985 [36]. The current limit 
recommended by the Center for Disease Control and 
World Health Organization is <10 pg/dl. Studies from 
around the world have found blood lead concentra¬ 
tions far exceeding this level in particular study 
populations (reviewed in [34, 37]). For example, chil¬ 
dren from a lead-contaminated area of Ecuador had 
blood lead concentrations of 52.6 pg/dl (range 9.9- 
110.0 pg/dl) [38]. Findings from the National Health 
and Nutrition Examination Surveys (NHANES) in the 
United States show that average children’s blood lead 
levels have been declining since the 1970s, and the 
prevalence of elevated levels in children was 0.7% in 
the 1999-2002 survey [39]. However, recent evidence 
suggests that blood lead concentrations below 10 pg/dl 
are not without adverse effect. 

The 10 pg/dl standard was based, in part, on a study 
wherein children were grouped based on umbilical- 
cord blood concentrations (either <3 pg/dl, between 
6 and 7 pg/dl, or >10 pg/dl). At 2 years of age, Mental 
Development Index scores were significantly lower in 
children from the >10 pg/dl group [40]. However, 
using the NHANES III cohort, Lanphear and 
coworkers found that, after results were corrected for 
confounding factors, there was an inverse relationship 
between blood lead concentration and cognitive func¬ 
tioning. Specifically, they reported “for every 1 pg/dl 
increase in blood lead concentration, there was a 
0.7-point decrement in mean arithmetic scores, an 
approximately 1-point decrement in mean reading 
scores, a 0.1-point decrement in mean scores on 
a measure of nonverbal reasoning, and a 0.5-point 
decrement in mean scores on a measure of short-term 
memory” [41]. Importantly, these results included 
children who had blood lead concentrations less than 
5 pg/dl. A more recent study by this laboratory 
followed 172 children measuring their blood lead con¬ 
centrations at 6,12,18,24, 36,48, and 60 months of age 
and tested their performance on the Stanford-Binet 
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Intelligence Scale at 3 and 5 years of age [42]. They 
found that as lifetime average blood lead concentrations 
increased from 1 to 10 pg/dl, IQ declined by 7.4 points 
using a nonlinear model or 4.6 points with a linear 
model. Additionally, chelation therapy is largely inef¬ 
fectual for children with blood lead concentrations 
between 20 and 44 pg/dl [43] and cognitive deficits 
due to lead exposure can last into adulthood [44]. 
These studies suggest that there is not a clear threshold 
for lead effects in children. 

Unfortunately, the molecular targets and mechanis¬ 
tic reasons to explain low-dose effects of lead exposure 
are currently lacking. Clearly, critical neurodeve- 
lopmental processes occur in the early years of life 
that have profound effects on learning and memory. 
A current area of emphasis with respect to mechanisms 
of lead toxicity is focused on N-methyl-D-aspartate 
receptor (NMDAR)-mediated synaptic plasticity, spe¬ 
cifically hippocampal long-term potentiation [37]. As 
reviewed by Toscano and Guilarte [37], in rodent 
models, environmentally relevant lead exposure alters 
NMDAR subunit expression and function including 
decreased calcium and nitric oxide signaling, both crit¬ 
ical signaling pathways involved in learning and mem¬ 
ory. Better understanding of the molecular targets of 
lead toxicity may in turn offer more effective therapeu¬ 
tic approaches to lead exposure in the future. 

Organophosphate pesticides (OP), such as chlor- 
pyrifos, have a well-recognized biomarker of exposure 
and effect in the irreversible inhibition of acetylcholin¬ 
esterase (AChE). Significant inhibition of AChE causes 
a recognizable “cholinergic syndrome” causing accu¬ 
mulation of acetylcholine at cholinergic synapses and 
overstimulation of cholinergic receptors of the musca¬ 
rinic and nicotinic type [45]. In addition to the acute 
cholinergic crisis, high-dose OP exposure can lead to 
OP-induced delayed polyneuropathy. Percent decreases 
in plasma pseudocholinesterase or red blood cell cho¬ 
linesterase (e.g., 30% from baseline) are used in work¬ 
place surveillance even though the physiological 
significance of circulating cholinesterase activity is 
not known. Furthermore, signs of intoxication typi¬ 
cally do not occur until cholinesterase inhibition is 
>70% [25]. Accordingly, organophosphate pesticides 
are another class of neurotoxicant that can exhibit 
strikingly different low- versus high-dose toxic effects. 


Chlorpyrifos is one of the most widely applied OPs. 
In the United States, registrations for its use in residen¬ 
tial settings were canceled in June 2000 [46]. Prior to 
this ban, chlorpyrifos was detected in 99.7% of per¬ 
sonal air samples and 100% of indoor air in an urban 
pregnant woman cohort [47]. When pregnant rats are 
given chlorpyrifos at doses below those affecting ace¬ 
tylcholinesterase, their offspring have permanent defi¬ 
cits in learning and memory [25]. Work in the Slotkin 
laboratory has established persistent changes in seroto¬ 
nergic and dopaminergic systems in adult rats exposed 
either prenatally or neonatally to chlorpyrifos at doses 
that are devoid of systemic toxicity [48-50]. Further¬ 
more, in rats exposed neonatally on gestational days 
17-20 (during peak neurogenesis), there were sex- 
dependent behavioral impairments persistent to adult¬ 
hood [51]. Intriguingly, in addition to the neurotox- 
icological effects of chlorpyrifos, adverse effects on 
adenyl cyclase cellular signaling cascades were noted 
in heart and liver of developmentally exposed rats 

[52] . As concluded by Slotkin, “The mechanisms for 
neurodevelopmental effects from toxins such as chlor¬ 
pyrifos cannot be presumed to be based on known 
mechanisms for systemic toxicity in the adult” [25]. 
More recently, an epidemiologic study of inner-city 
children compared umbilical cord chlorpyrifos plasma 
concentrations and cognitive and motor development 
(at 12, 24, and 36 months of age) and behavior (at 
36 months). A significantly higher proportion of 
children from the “highly exposed” group showed 
signs of mental and motor delays and were more likely 
to score in the clinical range for attention problems 
compared to the lower exposure group at 3 years of age 

[53] . Further insight into the mechanisms of chlorpyr¬ 
ifos and other organophosphates at low doses may 
further tighten regulatory control of this class of 
chemicals and/or offer therapeutic or psychological 
interventions to remediate low-dose and developmen¬ 
tal exposures. 

Endocrine disrupting chemicals (EDCs) can change 
endocrine function and cause adverse effects at the level 
of the organism, its progeny and/or (sub) populations 
of organisms. There have been a number of research 
conferences since the initial Wingspread Conference in 
1991, spearheaded by Dr. Theo Colburn, to address the 
effects of EDCs. Germane to this chapter, in 2000, the 
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EPA and the National Institute of Environmental 
Health Sciences (NIEHS) held a peer review “aimed at 
evaluating the scientific evidence on reported low-dose 
effects and dose-response relationships for endocrine 
disrupting chemicals in mammalian species that per¬ 
tain to assessments of effects on human health” [27]. 
Nine EDCs, not including dioxins or phthalates esters, 
were considered. More recently, as reviewed by 
Vandenberg and coworkers [54], bisphenol A (BPA) 
individually has been a focus of expert analysis and 
regulatory review. 

Bisphenol A is a high-volume chemical used in 
polycarbonate plastic. Use of this chemical has recently 
received intense media scrutiny because of its occur¬ 
rence in consumer products such as food container 
liners and baby bottles and ubiquitous occurrence in 
human samples [55]. Based on relatively low in vitro 
binding affinity to the estrogen receptor compared to 
estradiol, BPA was initially considered a weak environ¬ 
mental estrogen. However, more recently lower-dose 
effects are being noted at concentrations below those 
needed for nuclear estrogen receptor binding [56]. 
A number of in vitro studies measuring BPA-mediated 
effects in cell types as diverse as adipose explants, and 
pituitary, prostate cancer, and pancreatic cells have 
reported U-shaped nonmonotonic dose-response 
curves (reviewed in [54]). For example, cell prolifera¬ 
tion was lower at both low and high doses of BPA 
compared to intermediate doses. In vivo, BPA- 
mediated effects on CD-I mouse uterine endpoints 
also indicated nonmonotonic dose-response curves 
for some endpoints (e.g., vaginal opening and uterine 
wet weight) [57]. Fetal exposure to low doses of BPA 
causes changes in tissue organization and histoarch- 
itecture of developing mammary gland in mice [58, 
59]. In males, BPA causes adverse effects on the male 
reproductive system and sex-related behaviors [60, 61] . 
Important observations revealed from BPA studies, 
with particular relevance to low-dose effects, are that 
experimental design and study controls are critical. For 
example, compounds present in animal feeds can mask 
treatment effects, and different animal strains can have 
different sensitivities [62]. The exact mechanisms 
underlying BPA low-dose effects are still under inves¬ 
tigation. It is important to remember that estrogen is 
a critical physiological mediator of many diverse tissue- 


dependent cellular processes. Furthermore, appropri¬ 
ate timing of sex steroid exposure is key in establishing 
the hypothalamus-pituitary-gonad feedback necessary 
for successful development and reproduction. Issues of 
timing of exposure would be expected to have pro¬ 
found impacts on effects of BPA. 

Conclusions from the 2000 peer review included 
that relevant low-dose effects of BPA included increased 
prostate weight in male mice and advanced puberty in 
female mice after in utero exposure. Effects in rats (e.g., 
uterine growth and serum prolactin) were strain 
specific. They concluded “Data are insufficient to 
establish the shape of the dose-response curve for 
BPA in the low dose region, and the mechanism and 
biological relevance of reported low dose effects are 
unclear” [27]. In 2006, the NIEHS organized an addi¬ 
tional meeting specifically addressing BPA. A product 
of this meeting was the Chapel Hill Consensus State¬ 
ment [63], which concluded “.. .that human exposure 
to BPA is within the range that is predicted to be 
biologically active in over 95% of people sampled. 
The wide range of adverse effects of low doses of BPA 
in laboratory animals exposed both during develop¬ 
ment and in adulthood is a great cause for concern with 
regard to the potential for similar adverse effects in 
humans.” Meanwhile, the National Toxicology Pro¬ 
gram’s Center for the Evaluation of Risks to Human 
Reproduction (CERHR) evaluated the scientific data 
and concluded that some of the previously attributed 
effects of BPA (e.g., changes in prostate weight, repro¬ 
ductive tract malformations etc.) were not conclusive 
[64] . This report was highly criticized (see review [54] ), 
and NTP released its own report finally concluding 
“there is some concern for effects on the brain, behavior, 
and prostate gland in fetuses, infants, and children at 
current human exposures” but “minimal concern for 
effects on the mammary gland and an earlier age for 
puberty for females” [65] . Importantly in August 2008, 
and in an interim update issued January 2010, the US 
Food and Drug Administration, which regulates food 
packaging etc., released a draft report finding that BPA 
remains safe in food contact materials [66]. Some of 
the questions plaguing the debates on BPA risk assess¬ 
ment include different routes of exposure and metab¬ 
olism of BPA in animals and humans, lack of 
consistency in results between laboratory studies, and 
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the relevancy of the animal models to the human con¬ 
dition. Clearly, this is a timely ongoing debate that 
absolutely hinges on elucidating the consequences of 
low-dose environmentally relevant exposure to BPA. 

Future Directions 

The mechanisms of toxicity and the few selected xeno- 
biotics used as examples here emphasize how timely 
and relevant the issue of effects at low doses is. Further¬ 
more, answers to the issues raised are key for more 
informed and appropriate risk assessment. Toxicology 
studies historically have been limited to testing concen¬ 
trations representing the high end of environmental 
relevance. The need to test higher doses was necessi¬ 
tated both by inability to measure/recognize subtle 
effects, and the need to achieve statistically significant 
results with the minimal number of test organisms and 
cost. Now with the realization that, in general, mecha¬ 
nisms of toxicity are not consistent from high to low 
doses, new robust experiments are needed to explain 
the more relevant low-dose effects. However, care must 
be taken so that the cost of regulatory measures that 
conservatively estimate the harm of low-dose effects is 
balanced with the cost of potential mechanisms that 
alter the response of current or future populations. 
Furthermore, when a biomarker of effect is recognized 
to not represent true risk of adverse effect (e.g., DNA 
adduct formation), then more appropriate endpoints 
(e.g., mutation) should be incorporated into risk 
assessment. As described above, many low-dose effects 
are critically linked to the developmental age at the 
time of exposure. Populations which differ in age or 
previous exposure are likely to have different responses 
to low-dose treatment of any contaminant. The World 
Health Organization has developed a multiple expo¬ 
sure/multiple effects childrens health model incorpo¬ 
rating the influence of social, economic, and 
demographic contexts on health outcomes and expo¬ 
sure in addition to the relationship between exposure 
and effect [67] . As the underlying system biology of test 
organisms becomes better understood, better evalua¬ 
tion of effects of xenobiotic exposure will be possible 
[ 1 ]. Finally, the field of risk assessment and society in 
general may ultimately have to deal with the ethical 
questions that arise associated with potential hormetic 
effects of chemicals that are toxic at high doses [31]. 
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Glossary 

Bioaccumulation factor (BAF) The concentration of 
contaminant in the organism divided by the con¬ 
centration of the contaminant from all exposure 
sources, including water, air, soil, sediment, or food. 
Bioconcentration factor (BCF) The concentration of 
contaminant in the organism divided by the con¬ 
centration of the contaminant in the water. 
Biomarkers A quantifiable change in a biochemical, 
physiological, or histological parameter that can be 
used as a measure of response to a toxicant. 
Biological survey A systemic gathering of information 
on the presence, abundance, and condition of species 
at a given site to assess the health of those organisms. 
EPT index The number of organisms in the orders 
Ephemeroptera (mayflies), Plecoptera (stonefiles), 
and Trichoptera (caddisflies). These orders are typ¬ 
ically considered to be sensitive to contaminants. 
LC 50 The median lethal concentration, or the concen¬ 
tration of the medium that was lethal to 50% of the 
test organisms. 

LOEC The lowest observed effect concentration, or the 
lowest concentration of test medium in which sta¬ 
tistically significant toxic effects are seen. 

NOEC No observed effect concentration, or the 
highest concentration of test medium in which no 
significant toxic effects are seen. 

Toxicity test An assay in which organisms are exposed 
to soil, water, or air, and the effects of contaminants 
on processes such as survival, growth, and repro¬ 
duction are determined. 

Toxicogenomics Examining changes in gene expres¬ 
sion, protein expression, and/or metabolite profiles 
in a response to an exposure by a toxicant. 

Definition of the Subject 

An ecological risk assessment attempts to determine 
the effects of contaminants or other environmental 
stressors on individuals, populations, and communi¬ 
ties of species. Although ecological risk assessments can 


be used for regulating chemicals and managing water¬ 
sheds, the most typical use is for the study and reme¬ 
diation of contaminated sites. The organismal effects 
that can be examined vary considerably, depending on 
the ultimate use of the assessment, but most often focus 
on the organisms’ ability to survive, grow and develop 
appropriately, and reproduce. The choice of which 
organisms to use should be based on easily evaluated 
criteria, such as the ease of sampling a particular organ¬ 
ism, ensuring that the organism can both respond to 
the toxicant or mixture of toxicants, and that the 
organismal response is graded with respect to increas¬ 
ing exposure concentrations and/or times. The specific 
endpoints measured in the risk assessment with respect 
to organisms can therefore include lethality, weight 
gain, population numbers, birth and death rates, alter¬ 
ations in community structure, among others. Some 
endpoints are easily quantified, such as lethality. Other 
endpoints may be much more difficult to measure, 
such as reproduction, development, productivity, and 
immune function. For these more subtle outcomes, 
one often uses biological markers to indirectly assess 
these endpoints, or surrogate species in the laboratory 
to find out more about the health of the population or 
community under study. Other methods to better 
refine the risk assessment include the use of models 
and the use of genomics. The underlying goal to con¬ 
duct an effective assessment relies on the choice of the 
organism and the endpoint examined so that manage¬ 
ment decision is appropriate and protective of organ¬ 
ismal health. 

Introduction 

The selection of which organisms to monitor or use is 
an important part of the risk assessment process. This 
entry first discusses the role of organismal selection and 
response in the basic framework of risk assessment. The 
criteria used when selecting an appropriate organism to 
use and the common organisms routinely used in 
assessments will be discussed. Laboratory toxicity 
tests, using either standard organisms as surrogate spe¬ 
cies or nontraditional organisms that are more repre¬ 
sentative of the site can be used. Field studies are also 
routinely conducted, which can monitor the presence 
and the abundance of a variety of species. Beyond 
typical field assessment, biotic indices are used as 
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measures of community structure, while biomarker 
data are used to assess sublethal impacts that often 
cannot be determined during standard laboratory tox¬ 
icity testing. Finally, the entry will cover future direc¬ 
tions in risk assessment, which may include the use of 
genomics, proteomics, other multi-end point analysis, 
and physiologically-based pharmacokinetic (PBPK) 
modeling. 

Animal Models in the Basic Framework of Risk 
Assessment 

Overview 

The basic framework for risk assessment consists of 
four phases: problem formulation, analysis, risk char¬ 
acterization, and risk management [1,2]. Much of this 
entry will focus on the first two phases, where the 
selection and analysis of organismal responses is most 
crucial. There is an additional step prior to the risk 
assessment itself, which is termed planning [3]. At the 
planning stage, the risk manager, risk assessors, and 
stakeholders should work together to identify the 
goals of the risk assessment. These goals should include 
developing clear objectives for what organisms the 
manager is most interested in protecting, determining 
what contaminants are most important to consider for 
the remediation effort, and indicating the acceptable 
levels of contaminants or the desired condition of the 
environment and organismal health after the remedia¬ 
tion is completed [4]. 

During the problem formulation stage, a concep¬ 
tual model should be developed to describe the rela¬ 
tionships between the contaminants at the site and the 
organisms that may be affected by them. Although 
many species can be found at a site, the risk assessment 
is only concerned with those that could be adversely 
affected. Additionally, since there are always factors 
that mitigate or exacerbate the concentration of chem¬ 
ical that an organism encounters, these should be taken 
into account during model development. The risk 
assessor should identify the key ecological characteris¬ 
tics that allow or prohibit certain species to thrive. 
Potential confounding factors should be identified, 
such as physical manipulations, building and construc¬ 
tion, other human disturbances, or competition by 


invasive species [4]. There is also the issue of indirect 
effects, in which the impact on a species affects another. 
Examples include a loss of a prey or predator species, or 
loss of vegetation that provides food and shelter. Once 
these factors are taken into account, then appropriate 
selection of the animal models to use in the risk assess¬ 
ment can occur. 

During the analysis stage, there is both a character¬ 
ization of the exposure and a characterization of the 
effects. Organisms may be sampled from the exposed 
site and changes due to the exposure determined. 
If suitable native species cannot be found or cannot 
be found in great enough numbers, outside organisms 
may be brought in and caged at the site to determine 
the amount of toxicant being taken into the organism 
and its resultant effects. More frequently, if a suitable 
field study cannot be performed on site, laboratory 
tests are conducted using surrogate species in standard¬ 
ized tests to determine the effects of contaminant 
exposure. 

After data collection, one must determine whether 
there is a relationship between the contaminants and 
effects on organisms. One useful way to categorize the 
strength of the evidence is based upon the Bradford- 
Hill criteria [5], which provide a basic framework for 
assessing the strength of causal relationships. Although 
Hill originally published these criteria for human epi¬ 
demiological work to determine the causes of diseases, 
they have been modified for ecotoxicology and risk 
assessments [6, 7]. The criteria or types of evidence 
a risk assessor is looking for to support a cause-effect 
relationship include: 

1. Spatial co-occurrence - the effect should be 
observed only where the contaminant is present. 

2. Temporality - there should be a cause-effect time 
relationship, in that the effect has to occur after 
the exposure to the contaminant. 

3. Biological Gradient - this criterion describes the 
classical exposure-response relationship, in which 
higher exposure concentrations should lead to 
a greater incidence or more severe effects if the 
effect is due to the contaminant. 

4. Complete Exposure Pathway - the contaminant has 
to be able to reach the organism in order to cause 
an effect. If the toxicant is in media that are 
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inaccessible to the organism of interest, than expo¬ 
sure will not occur. 

5. Consistency of Association - if similar observations 
are documented by multiple investigators in mul¬ 
tiple locations, this increases the likelihood that 
the effects seen are due to the contaminant. 

6. Experiment - if the exposure is manipulated or 
modified, this should cause a decrease in effects. 
The manipulation can be conducted by using site 
media in the laboratory and altering the contam¬ 
inant’s concentration by dilution, or by 
conducting field experiments in which the expo¬ 
sure can be modified, contained, or eliminated. 

7. Plausibility - this criterion asks whether the effects 
seen in the organisms can be explained because of 
a known or plausible biological mechanism. Addi¬ 
tionally, the risk assessor should understand 
whether the effects would be expected at the con¬ 
centration of contaminant seen at the site. 

8. Analogy- this criterion examines uses case studies 
from contaminants that are similar in structure to 
the suspected contaminant, and asks whether the 
expected effects be the same. 

9. Specificity of Cause - is the observed effect seen 
consistently associated with the contaminant or 
likely to only be caused by the contaminant. If 
so, the cause-effect relationship is quite strong. 

10. Predictive Performance - this criterion asks 
whether side effects occurred as a result of the 
exposure, and if so, could they have been predicted 
based upon the contaminant’s mode of action. 

The first four provide the strongest evidence, since 
they are directly based upon the site being assessed. The 
next six are often considered less strong, since the data 
for these criteria typically come from laboratory studies 
or other field-based studies, which may or may not be 
completely applicable for the site in question. However, 
they provide corroborating evidence that, when 
coupled with the first four, can support the risk char¬ 
acterization process. 

Monitoring Trends in the Environment 

Knowing the baseline levels of contaminants and their 
impact on the health of populations and communities 
can support or act as a precursor to a risk assessment. 


Baseline levels are found by monitoring trends in the 
environment. Although these are not risk assessments, 
they can provide valuable information to the risk asses¬ 
sor. The US EPA’s Environmental Monitoring and 
Assessment Program (EMAP) is one such large-scale 
endeavor that seeks to monitor sites in a particular 
region for many years, thus providing data on multiple 
spatial and temporal scales [8, 9]. EMAP data can 
include such information as contaminant levels, bacte¬ 
rial loads, and species distributions. Thus, the data can 
confirm trends in a particular area or provide baseline 
levels to serve as a reference site to aid risk assessors. 

Biological surveys may also provide valuable infor¬ 
mation to support a risk assessment. Biological sur¬ 
veys most often are conducted in aquatic systems and 
often have data bases that span multiple sites and 
years. They are typically used to determine if an 
aquatic system is meeting its designated aquatic life 
uses, but have also been used to evaluate the effective¬ 
ness of controls and restoration activities, and charac¬ 
terize regional biotic attributes of reference sites. 
These surveys determine the presence or abundance 
of species at a given point in time. Information may 
also be collected on the numbers of sensitive or toler¬ 
ant species, and may provide information on the over¬ 
all community composition. These data is often 
converted and used in as a biotic index, which assigns 
numbers to analyzed sites to indicate the level of 
impact that external factors have on the community 

[10] . The survey can provide baseline data on the 
health of a particular system or region, which can be 
used as a reference during the risk assessment to deter¬ 
mine whether the impacted site is similar or not. 

Ecological epidemiology is another method to 
monitor trends in the environment. This is often used 
prior to a risk assessment, when the sources of contam¬ 
ination are either unknown or due to nonpoint sources 

[11] . Most often, this endeavor is initiated after reports 
of a noticeable environmental effect, such as the mor¬ 
tality of a large number of individuals, an increase in 
the number of deformities, or the decline of a species. 
The goal of the epidemiological study is to try to form 
a causal chain from source to exposure to effects. Once 
the cause is identified, those parties involved can then 
formulate the problem and continue on with the tra¬ 
ditional risk assessment. 
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Considerations When Determining Which 
Organisms to Use for an Ecological Risk 
Assessment 

The basic purpose of the risk assessment is either to 
estimate the exposure to the consumers or to the end¬ 
point organisms, such as a carnivore higher up the food 
chain. Therefore, the relevance of the assessment 
heavily depends on choosing the appropriate organ¬ 
isms to monitor or study, or the appropriate species 
assemblages. There are certain criteria that are desirable 
when considering which organisms to use in the risk 
assessment [4]. First, the organism should be easily 
sampled. This suggests that the organism should be 
abundant in the area under study and amenable to 
capture and holding. Second, the organism should be 
sensitive to the contaminants in the system. If a partic¬ 
ular species is known to be insensitive to PAHs, for 
example, than selecting it to determine effects of creo¬ 
sote contamination will not be protective of the envi¬ 
ronment. Third, the organism should respond to the 
toxicant under question in a predictable manner. Ide¬ 
ally, one wants a gradual change in response when the 
contaminant levels are increased. Ideally, a threshold 
should exist in which contaminant concentrations 
below these levels should be of little concern because 
they will not cause adverse effects [4, 12]. 

Levels of Biological Organization 

There are different levels of biological organization that 
can be used in an ecological risk assessment: (1) mon¬ 
itoring or testing individual organisms; (2) testing or 
monitoring populations of a particular species; or 
(3) community level assessment (Table 1). The choice 
of level will ultimately depend upon a variety of factors, 
such as the types of organisms present, their abun¬ 
dance, and their types of interactions with one another. 
Using the assessment to investigate impacts on a rare 
species might necessitate only examining a few individ¬ 
ual organisms. However, if during the problem formu¬ 
lation step, there is concern about indirect effects, in 
which the impact on one species affects another, than 
population or community levels may be most appro¬ 
priate. For example, if the contaminant is thought to 
reduce levels of a prey species, then using a community 
level approach to determine predator-prey interactions 
might be in order. 
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Table 1 Levels of biological organization and typical end¬ 
points assessed during a risk assessment 


Organism 

Population 

Community 

Survival 

Survival 

Survival 

Growth 

Population 

density 

Population 

density 

Condition factor 

Reproductive 

success 

Community 

structure 

changes 

Reproductive 

success 

Productivity 

Trophic 

imbalances 

Tumors 

Changes in 

genetic 

structure 

Biomass 

Behavioral 

alterations 

Development 
of resistance 


Acetylcholinesterase 

inhibition 

Biomass 


Stress proteins 

Changes in metabolism 

Immune system alterations 


The spatial characteristics of the site and the specific 
agent or agents being assessed will clearly influence the 
choice of biological organization, as will any temporal 
variations in the exposure. For example, if one is doing 
an assessment involving an effluent stream released 
once per month, this will necessitate a different strategy 
for testing and monitoring than an assessment for 
remediation of a contaminated site in which the organ¬ 
isms are continuously exposed. Along with temporal 
variations, the concentration of the agents may make 
a difference in the choice of scale as well. An exposure 
that is extremely high in concentration and pulsatile in 
nature would warrant different tests than a low level, 
chronic exposure [3]. 

The first level of organization is to monitor or test 
individual organisms. The endpoints garnered from 
these tests are typically survival, growth, and/or repro¬ 
duction (Table 1). Depending on the organism used, 
these tests can take anywhere from hours to days. 
Population level tests typically are conducted via 
microcosms or mesocosms, using multiple organisms. 
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The endpoints assessed are usually survival and 
growth/production, which takes days to months to 
conduct. The effects on populations can also be deter¬ 
mined in the field by either a designed test or, more 
often, by simply monitoring the environment. Exam¬ 
ples of these types of investigations might include 
examining organisms in a lake or pond, or in gridded 
and staked terrestrial areas. Again, the endpoints often 
assessed are survival, growth, reproduction, and pro¬ 
duction, which may take days to years to conduct. 
Community assessments are conducted either using 
a mesocosm (a small community), field tests, or envi¬ 
ronmental monitoring. Beyond the typical endpoints 
of survival and growth, one can now examine changes 
in community structure and trophic balance. 


TD 

C 

o 

o 

o 


10 years - 
1 year - 
Months - 
Days - 
Hours - 





- 1 - 1 - 1 - 

Organisms Populations Communities 

Ecological Risk Assessment and Animal Models. Figure 1 

Time scale for collecting field or laboratory data to support 
the risk assessment 


E 


Exposure Time, Organismal Characteristics, 
and Exposure Routes 

Once the level of biological organization is decided 
upon, the next step is to determine the types organ¬ 
ism^) to use and how long to expose or monitor them. 
Influencing factors for the exposure time and type of 
organism include the type of media that are contami¬ 
nated (soil, sediment, water), the concentrations of the 
contaminants, and what types of organisms are present 
at the site, and how many of them are at the site. 

The length of exposure time for each of these tests 
depends on the agent and how it is released or occurs in 
the area. If assessing the effects of a pesticide applica¬ 
tion, one may only need to conduct the test or monitor 
the environment for a few days. If assessing the effects 
of an oil spill, monitoring for months to years might be 
more appropriate (Fig. 1). The timing of when the test 
is conducted also needs to take into account issues like 
seasonal variability. Water and food may be limited at 
certain times during the year, and thus the organisms’ 
exposure could be reduced. Additionally, breeding and 
migration times may pose issues for accurate monitor¬ 
ing and should be taken into account before beginning 
the study. 

Most risk assessments focus on a limited number of 
organisms - typically no more than just a few species or 
a small community, if looking at invertebrates or algae. 
The choice of organisms to examine therefore becomes 
critical. Among the specific factors and organismal 
characteristics that should be taken into consideration 


is the mobility of the organism. Ideally, one would want 
to monitor an organism with limited mobility, such as 
earthworms or invertebrates, or to monitor those 
organisms with low mobility at some stage in their 
life cycle [12]. Another concern is the sensitivity of 
species toward the contaminant of interest. Ideally, 
one would want to use the most sensitive species, 
which would be protective of all other species. How¬ 
ever, given that most sites have a suite of contaminants, 
finding the species most sensitive to the mixture maybe 
difficult. A third consideration is the ecological func¬ 
tion or role of the organism. Often, it is good to choose 
at least one primary producers to study because of their 
large contribution to the food chain. Other factors to 
take into account are choosing an organism that is easy 
to collect and choosing an organism of high relative 
abundance. Finally, one also needs to be concerned 
with a species that has a certain recognized value, either 
economic or symbolic, as the one that you want to 
protect. It may be necessary to monitor these types of 
species, even if they do not fit the other criteria for 
choice of an organism to use [12]. 

The last issue is to determine the most likely route 
of uptake, be it oral, dermal, or inhalation (for a recent 
review, see [13]). The oral route is the most likely 
exposure route (Table 2). However, organisms can 
also swim in water, absorb contaminants through der¬ 
mal contact, and take up compounds via inhalation, so 
that depending upon the contaminants at the site, these 
minor routes of uptake may be more important. For 
terrestrial invertebrates, the most common type of 
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Table 2 Routes of uptake in different organisms 


Organism 

Common uptake routes 

Terrestrial invertebrates 

Oral, dermal 

Aquatic invertebrates 

Oral, respiratory surfaces 

Fish 

Oral, gills, dermal 

Mammals 

Oral >> dermal, inhalation 

Birds 

Oral, inhalation 

Amphibians 

Dermal, oral 

Reptiles 

Oral 


Routes listed are in the order of importance for each class of 
organism 


organism to monitor is the earthworm. For these ani¬ 
mals, exposure occurs either dermally or orally. Other 
arthropods can be examined, but these typically have an 
exoskeleton, so a food-borne exposure is more likely 
than dermal exposure. Aquatic invertebrates are most 
likely exposed through the oral route. Rather than 
conducting a test or monitoring these organisms, it is 
sometimes more desirable to use water or sediment 
bioaccumulation factors (BAFs) to estimate exposure. 
The uptake route of fish is typically oral but can also be 
via the gills or as a dermal exposure. As with aquatic 
invertebrates, one sometimes estimates the exposure to 
fish using BAFs or bioconcentration factors (BCFs). 
BAFs tend to be more consistent across different water 
chemistries and different organisms. They are also con¬ 
sidered more realistic of an exposure since diet and food 
chain multipliers are included within the BAF. BCFs are 
determined in laboratory-based studies and are consid¬ 
ered more precise. Therefore, a BCF is often what is used 
to estimate exposure, if available [3]. Exposure 
factors for many organisms can be found in the Wildlife 
Exposure Factors Handbook [14]. 

For most terrestrial organisms, the oral uptake 
route tends to be most important (for a recent review, 
see [13]). In mammals, the uptake of contaminants 
tends to be through the diet, while the dermal and 
inhalation routes tend to be negligible. Mammals 
have extremely varied feeding strategies, both in terms 
of prey preference and in feeding frequency. This can 
result in high exposures to some mammals while no 
exposure to others, so care must be taken in the risk 


assessment to choose a species that will come into 
contact with the contaminants of concern. Periods of 
fasting, hibernation, and migration can mobilize stored 
contaminants, resulting in toxicity without changing 
the body burden. Additionally, lacatational and placen¬ 
tal transfer may occur from mothers to the developing 
offspring. In birds, the oral route is the most important 
uptake pathway, most commonly through the inges¬ 
tion of contaminated prey items. During pesticide 
exposure, uptake via inhalation may contribute signif¬ 
icantly to their body burdens. The transfer of contam¬ 
inants from gravid females into their eggs may also 
represent a significant exposure source to embryos. In 
reptiles, the oral route is most important via ingestion 
of contaminated food items. The skin tends to inhibit 
some compounds from passing through dermally. 
As with avian species, contaminant transfer into 
eggs also represents an important exposure route. 
In contrast, the dermal route may be extremely impor¬ 
tant to amphibians due to their life cycle, which is spent 
in contact with both water and soil for parts of their 
life span. 

When quantifying how much contaminant the 
organism had received during the exposure, one can 
determine its concentration in the whole body of the 
organism, its burden in an organ in which the toxicant 
is likely to exert a toxic effect, such as the liver, kidney, 
or brain, or in organs of accumulation, such as fat, hair, 
or feathers. As stated above, for aquatic organisms, 
BAFs or BCFs are often used to estimate exposure. 

Field Studies and Toxicity Testing 

Toxicity tests involve collecting contaminated soil, sed¬ 
iment, or water from the site, and then exposing organ¬ 
isms to that medium or those media in a laboratory 
setting. There are several different advantages of toxic¬ 
ity tests over field studies. First, a toxicity test can 
evaluate whether the contaminant was bioavailable to 
the organisms. Second, a toxicity test can enable a more 
informed assessment of the impact of mixtures of com¬ 
pounds than the use of data in the literature on single 
compounds. They can determine whether characteris¬ 
tics of the medium itself, such as hardness or pH, are 
influencing toxicity. Third, they can examine sub¬ 
stances whose effects on wildlife are not well under¬ 
stood. Fourth, they are typically used for monitoring 
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to determine whether the risk mitigation strategy is 
working [2, 15]. 

Although a variety of endpoints can be measured 
from toxicity tests, the most common ones are survival, 
growth, and reproduction. Survival is most often 
assessed using a short term, acute toxicity test. An 
acute test typically lasts 24-96 h, during which the 
organisms are exposed to a medium from the site, 
plus a reference medium, and the number of deaths is 
recorded. Outcomes of this test include the medium 
lethal concentration or LC 50 , which is the concentra¬ 
tion of the medium that killed 50% of the test organ¬ 
isms. Other values that are typically obtained are the 
lowest observed effect concentration (LOEC), which is 
the lowest concentration of test medium in which sta¬ 
tistically significant toxic effects are seen, and the no 
observed effect concentration (NOEC), which is the 
highest concentration of test medium in which no 
significant toxic effects are seen. 

Sublethal endpoints, such as growth and reproduc¬ 
tion, are best measured using a chronic toxicity test. 
These tests take a longer period of time to conduct, 
because they encompass either a critical developmental 
phase or the organism’s entire life cycle, but give better 
resolution as to whether a toxic effect is occurring on 
site. Similar to an acute test, the values obtained are 
typically a median effective concentration (EC 50 ), or 
the concentration of medium that caused effects, such 
as reduced growth or reproduction, on 50% of the 
organisms. A LOEC and NOEC are typically calculated 
for chronic tests. 

Generally, values such as the LC 50 , NOEC, and 
LOEC for a single contaminant using standardized 
test organisms can be easily be obtained from the 
literature or using databases such as the EPA’s ECOTOX 
database (http://cfpub.epa.gov/ecotox/) or the USGS’s 
Contaminant Exposure and Effects-Terrestrial Verte¬ 
brates database (CEE-TV) (http://www.pwrc.usgs.gov/ 
contaminants - online/p ages/ceetv/ ceetvintro .htm ). This 
means that oftentimes, a de novo test may not be 
necessary. However, one should consider conducting 
a toxicity test if the data for the compound or interest is 
absent in the literature, if there is reason to suspect that 
conditions at the site, such as pH or organic matter, 
may be influencing bioavailability or toxicity to organ¬ 
isms, or if the interactions between mixtures of com¬ 
pounds are considered significant. 


Standard Toxicity Tests 

When the risk assessor decides that a de novo toxicity 
test is needed, there are several elements that need to 
be decided upon. First, the assessor should decide 
upon the level of effort needed to achieve the goals 
of the study. If he or she only wants to know 
whether the medium is toxic or not, using two test 
organisms and collecting medium from just a few 
sites may be sufficient to obtain the desired infor¬ 
mation. However, to fully characterize the site’s tox¬ 
icity, the study might use test organisms at different 
trophic levels, a number of sampling locations, and 
several dilutions of each test medium [15]. Second, 
a reference site should be included that is similar in 
characteristics to the site under study and is located 
as close to the study site as possible. Third, the test 
medium must be decided upon. Although it is most 
typical to use the water, sediment, or soil directly 
from the site, one can use soil or sediment elutriate 
to expose the test organisms. Finally, the test organ¬ 
isms themselves need to be selected. Whenever pos¬ 
sible, a standardized organism should be used that is 
representative of the onsite organisms and is sensi¬ 
tive to the contaminant of concern [15, 16]. 

Several types of acute aquatic tests involving fish, 
macroinvertebrates, mysids, Daphnia, and algae have 
been developed (for a good overview, see [ 17] ). Most of 
these are 48-96 h LC 50 tests that measure survival as the 
endpoint (Table 3). Chronic tests generally involve 
either early life stage tests with juveniles, but there are 
some partial or full life cycle tests. Most of these mea¬ 
sure endpoints such as survival, growth, and reproduc¬ 
tion (Table 3). 

Nonstandard Tests and Field Tests 

If a laboratory test is needed to assess site medium, the 
risk assessor should try to choose a standardized test 
and a standardized organism (Table 3). This is not 
always possible, as the biota in a particular site may 
not be similar enough to a standardized organism, or 
the standardized organisms are not sensitive to the 
contaminant at the site. In these cases, using an alter¬ 
native species is completely appropriate [2, 15]. Addi¬ 
tionally, field studies are routinely conducted at the 
actual site. These studies are more concerned with 
effects on populations and communities, rather than 
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Test 

Organism 

Test duration 

Endpoints 

Reference 

Freshwater acute 

Daphnia magna or Daphnia pulex 

48-96 h 

Survival 

[18, 19] 


Fathead minnow 

96 h 

Survival 

[18, 19] 


Rainbow trout 

96 h 

Survival 

[18, 19] 


Brook trout 

96 h 

Survival 

[18, 19] 

Freshwater chronic 

Cerodophnia dubia 

7 days 

Survival, 

reproduction 

[20] 


Daphnia magna 

21 days 

Survival, 

reproduction 

[21] 


Fathead minnow larvae 

7 days 

Survival, growth 

[22] 


Algae ( Selenastrum capricornutum) 

96 h 

Cell density 

[23] 

Saltwater acute 

Bivalve mollusk larvae 

48 h 

Abnormal shell 
development 

[24] 


Silverside (Menidia sp) 

48 h 

Survival 

[18, 19] 


Sheepshead minnow ( Cyprinodon 
variegates ) 

48 h 

Survival 

[18, 19] 

Saltwater chronic 

Silverside larvae 

7 days 

Survival, growth 

[18, 19] 


Sheepshead minnow 

7 days 

Survival, growth 

[18, 19] 


Mysid shrimp 

7 days 

Survival, growth, 
reproduction 

[25] 


Sea urchin (Arbacia punctulata) 

20 min 

Egg fertilization 

[25] 


Algae ( Champia parvula) 

5-7 days 

Fertilization 

[26] 

Freshwater sediment chronic 

Hyalella azteca 

10-30 days 

Survival, growth, 
reproduction 

[26] 


Midge (Chironomid tetans or C. riparius) 

10-14 days 

Survival, growth 

[26] 

Marine sediment chronic 

Amphipod 

10 days 

Burrowing 

[25] 

Terrestrial 

Earthworm ( Eisenia foetida ) 

14 days 

Survival 

[27] 


Bobwhite quail 

8 days 

Survival, growth 

[28] 


Mallard duck 

8 days 

Survival, growth 

[28] 


on individual organisms [ 12] . Reasons for doing a field 
study might include the fact that a current or future 
effect is likely, but there is not enough information to 
support a management decision, or it can provide 
additional evidence to link a site’s contaminant and 
adverse effects. The key to conducting a good 
nonstandardized laboratory test or a field test is to 
find an appropriate reference site. 


Studies of sites in which the contaminants of con¬ 
cern are found in the water should include the exami¬ 
nation of the effects on periphyton, plankton, benthic 
macroinvertebrates, or fish [12]. Periphyton and 
plankton, such as algae, protozoa, and some crusta¬ 
ceans, are primary producers. Periphyton is typically 
collected by scraping or suction, or an artificial sub¬ 
strate may be deployed for a number of days to allow 
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attachment. Plankton is collected by pumping, netting, 
or trapping. Endpoints examined usually involve mea¬ 
sures of community composition, such as species num¬ 
bers, diversity, and richness. 

Benthic macroinvertebrates are indicative of the 
effects of sediment contamination and are typically 
a good choice to examine if there are lipophilic con¬ 
taminants at the site. They are collected by digging, 
netting, dredging, or grab sampling. Endpoints mea¬ 
sured include the number and types of species, and 
other measures of species diversity. Macroinvertebrates 
can also easily be used in laboratory sediment toxicity 
tests, using either spiked artificial sediments or sedi¬ 
ments taken from the site. Fish are good to examine 
because they can occupy different trophic levels, and 
they are relatively easy to collect by seining, passive 
netting, trawling, or electrofishing. Endpoints exam¬ 
ined are often numbers and diversity, or sublethal 
effects such as behavior or tumor burden. Like benthic 
macroinvertebrates, fish are often used in laboratory 
toxicity tests. 

Studies of sites in which the contaminants of con¬ 
cern are found in the soil often involve examination of 
effects on earthworms, nematodes, amphibians, mam¬ 
mals, or reptiles [29]. Earthworms and other soil nem¬ 
atodes are good for both metals and organics since they 
are in constant contact with the soil. Nematodes are 
collected by coring, sieving, or they can be driven from 
the soil by heat. Worms can be tested using spiked soil 
as the medium, or are examined directly on site, with 
typical endpoints being survival and reproduction 
[27]. Organisms such as amphibians, reptiles, and 
mammals are not used or collected as frequently. 
When they are, the risk assessor generally chooses 
a small species, since its home range is likely to be 
smaller and it tends to be more abundant than larger 
species. These types of organisms are generally col¬ 
lected by trapping, or their presence can simply be 
determined by examining surrogate measurements 
such as footprints, droppings, or carcasses [12]. Birds 
are not typically used or examined in a risk assessment 
because their home ranges can be quite large and their 
collection tends to be difficult and time intensive. 
When they are examined, they can be trapped to deter¬ 
mine their numbers, or surrogate measures, such as 
auditory signals, used to determine their presence. 


Alternatively, the number of nests can be counted and 
the success of each nest can be documented. 

Other Assays Providing Support to Risk 
Assessments 

Beyond using standard toxicity tests to assess survival, 
growth, and reproduction, or looking for the presence 
of an organism during a field survey, there are several 
different ways to determine other types of adverse 
impacts, such as altered community composition, to 
estimate body burdens of toxicants, and to examine 
markers or exposure and/or effects. 

Biological Surveys 

Two such methods to examine altered community 
composition are biotic indices and rapid bioassessment 
protocols. These two types of surveys compare popu¬ 
lation and community compositions of macroinver¬ 
tebrates, fish, or periphyton between a reference site 
and the site of interest. For both of these protocols, an 
initial survey is conducted to quantify the presence and 
abundance of different species at both sites. These 
quantities are then converted into numerical values 
that relate to the overall health of the community. 

More than 50 different indices have been developed 
to assess community health [10]. Parameters examined 
include species richness, or the number of species in 
a community, density, dominance, diversity (the abun¬ 
dance of individuals in one taxon versus all other indi¬ 
viduals), as well as the number of sensitive species, 
such as the number of species belonging to the 
Ephemeroptera (mayflies), Plecoptera (stonefiles), 
and Trichoptera (caddisflies) or the EPT index [30]. 
Other metrics include percent dominant taxon, percent 
tolerant taxa, and percent Chironomidae, all of which 
are predicted to increase in response to increased per¬ 
turbation [10]. The Family Biotic Index is also a good 
measure of perturbation, as it uses tolerance values to 
weight abundance in an estimate of overall pollution 
[10]. Other indices such as the Simpsons Diversity 
Index and the Shannon-Wiener Diversity Index are 
also used to account for the richness and the evenness 
of the different species that make up the total commu¬ 
nity [31]. Finally, determining the percentage of 
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individuals in each of the five functional feeding groups 
can be used to determine whether the trophic structure 
is balanced [32]. 

Since many of this indices require a lot of time, 
expertise, and money, rapid bioassessment protocols 
(RBPs) have been developed since the 1980s to obtain 
these same measurements in a more cost effective and 
efficient way. These protocols were developed from 
previously existing bioassessment methods that were 
used by many state agencies. The RBPs examine other 
parameters besides numbers of organisms. In the 
case of streams, these include the available cover, 
embeddedness, velocity and depth of the river or 
stream, sediment deposition, channel flow status, 
channel alteration, frequency of riffles, bank stability, 
vegetative protection, and riparian vegetative zone 
width [10]. RBPs are appropriate for distinguishing 
the existence and severity of impairment of the water 
body, as well as identifying the causes of impairment. 
They are also used to determine if a stream is 
supporting a designated aquatic life use specified 
in state Water Quality Standards, evaluating the 
effectiveness of controls and various restoration 
activities, and characterizing regional reference con¬ 
ditions. Additional information on both types of 
bioassessments and their uses is provided in another 
entry. 

Biomarkers and Toxicokinetic Models 

A second method to estimate exposure of an organism 
to a contaminant is to use a biomarker. Biomarkers are 
defined as a quantifiable change in a biochemical, phys¬ 
iological, or histological parameter that can be used as 
a measure of response to a xenobiotic. The idea behind 
biomarkers is that they can detect more subtle adverse 
effects than lethality at lower chemical concentrations, 
which would better protect the population or commu¬ 
nity. Biomarkers can be indicators of either exposure or 
effects, although exposure biomarkers are much more 
common. Examples of biomarkers routinely used in 
environmental toxicology studies include the induc¬ 
tion of cytochrome P4501A (CYP1A) to indicate PAH 
exposure, acetylcholinesterase activity to indicate 
organophosphate or carbamate pesticide exposure, 
metallothionein induction to indicate exposure to sev¬ 
eral metals, vitellogenin to indicate exposure to 


estrogenic compounds, and eggshell thinning to indi¬ 
cate organochlorine exposure. Additional information 
on biomarkers is provided in another entry. 

A final strategy is to incorporate physiologically- 
based pharmacokinetic (PBPK) models into the risk 
assessment. These model the time course of disposition 
of toxicants in the whole organism, which gives an 
estimate of body burdens. In a PBPK model, the organ¬ 
ism is represented by several different compartments in 
which toxicants might be expected to be absorbed in, 
distributed to, biotransformed in, and excreted from, 
such as the alimentary canal, blood, fat, liver, and 
kidney. Thus, one can use the model to predict con¬ 
centrations of contaminants in an organism, rather 
than having to measure actual body burdens of con¬ 
taminants, which can be time consuming and 
extremely expensive. Beyond cost, the models have 
other advantages, such as being able to provide a time 
course of distribution of toxicants to any organ or 
tissue, being able to estimate effects by changing phys¬ 
iological parameters such as food consumption, mobil¬ 
ity, or reproductive status, and being able to use the 
same model to predict toxic action in different species. 
Although PBPK models can improve estimates in 
determining whether the contaminant will accumulate 
to high enough levels in the organism of interest to 
cause an adverse effect, there are a number of distinct 
disadvantages to their use compared to simply measur¬ 
ing body burdens. First and foremost, one needs infor¬ 
mation to build the model, such as blood volume, heart 
rate, breathing rate, and organ volumes. The fact that 
these parameters are often ill-defined for many species 
detracts from the power and utility of the model. Sec¬ 
ond, the mathematics, computational tools, and train¬ 
ing needed to build and run the models may be out of 
reach for many risk assessors. 

In spite of these obstacles, there are several PBPK 
models available for wildlife species, such as fish and 
polar bears, which can describe the body and tissue 
burdens of contaminants. For example, fish PBPK 
models have been built for rainbow trout and have 
been used to estimate tissue concentrations of 
paraoxon and cadmium [33-35]. Fikewise, there is 
a PBPK model that estimates arsenic concentrations 
in the different organs of tilapia (Oreochromis 
mossambicus ) [36]. Additionally, there are also 

models to assess ionizing radiation doses to a variety 
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of non-human species [37] . The models for polar bears 
are a bit more sophisticated than those built for fish, in 
that they can estimate tissue concentrations of 
organohalogenated compounds, such as dieldrin and 
PCBs, but can also be used to estimate risk of effects, 
such as adverse impacts on reproduction [38]. It is 
likely that they can be extended to other at-risk 
populations, such as orcas and the arctic fox [39]. 

There are a few models in reptiles and birds that 
were developed to be more applicable to veterinary 
uses, but could be used to estimate contaminant con¬ 
centrations in wildlife. Examples include a model 
developed in red-eared sliders (Trachemys scripta 
elegans ) to determine the pharmacokinetics and tissue 
distribution of buprenorphine, a potential analgesic 
drug for reptiles [40]; a model for midazolam devel¬ 
oped in chickens, but able to accurately predict tissue 
residues in turkey, pheasant, and quail [41]; and 
a model used to determine tissue levels of lipophilic 
pesticides in chickens, ducks, geese, and turkeys [42]. 
Alternatively, there are many rodent models for envi¬ 
ronmental contaminants that could potentially be 
adapted for use in other species. One such recent 
example includes a PBPK model in rats for develop¬ 
mental exposure to polybrominated diphenyl ethers 
(PBDEs) [43]. 

Future Directions Using Animal Models in Risk 
Assessments 

Some future directions for increasing the quality of risk 
assessment is to refine our understanding of sublethal 
effects rather than focusing on survival as an endpoint, 
and to better determine the mode of action of contam¬ 
inants. One potential way to accomplish this goal is 
through the use of genomics - analyzing changes in 
gene, protein, and metabolite expression. Genomics 
has the advantage of examining thousands of genes or 
proteins simultaneously as opposed to traditional bio¬ 
markers, which often measure a single gene or protein 
at a time. Like biomarkers, genomic changes can be 
linked to both exposure and effect outcomes. For 
example, rather than looking for reproductive failure 
in a toxicity test, one could examine changes in genes 
involved in steroidogenesis. These genes should be 
differentially expressed at much lower toxicant concen¬ 
trations, yet can still provide a predictive measure that 


adverse reproductive effects are likely to occur [44]. 
Thus, genomic techniques are more sensitive than tra¬ 
ditional toxicity testing or biomonitoring and can be 
used to detect early indicators of environmental 
perturbation. 

One of the issues encountered when using genomic 
data in a risk assessment is that baselines or control 
patterns of expression need to be established for 
a variety of species. This is true for those organisms 
routinely used in risk assessments, such as fathead 
minnows and Daphnia, but even more so for 
nontraditional species and those organisms collected 
in field studies, since the variation in natural 
populations can be quite significant [45]. Second, the 
data generated from a genomic study cannot be easily 
translated into terms familiar to many risk assessors, 
such as a NOEL or EC 50 [46, 47]. Some recent studies 
have developed no observed transcriptional effect levels 
(NOTEL) for Daphnia magna exposed to copper, and 
these NOTELs were validated after exposing Daphnia 
to water samples collected upstream and downstream 
from two copper mines in California [48, 49]. How¬ 
ever, there are almost no published studies using 
a NOTEL and relating that value to an adverse event. 
Third, there is often a lack of phenotype associated with 
the changes in mRNA levels, making it hard to interpret 
just what types of adverse effects would be occurring to 
organisms at the site [46, 47, 50]. Studies linking 
adverse effects, such as altered growth and reproduc¬ 
tion, with transcriptomic changes have also been 
conducted (reviewed in [51], but there are too few of 
these published to support the current use of genomics 
in risk assessments. Finally, genomic studies are expen¬ 
sive and the computational needs are quite high. 

The current policy at EPA is that toxicogenomics 
data may be used in a weight-of-evidence approach for 
ecological risk assessment, but must be provided in 
conjunction with other information. At this point, 
genomics data alone are insufficient for a risk assess¬ 
ment and any subsequent management decision 
[52, 53]. However, some potential uses for genomics 
in risk assessment for the foreseeable future are to guide 
the experimental design of monitoring programs so as 
to reduce the number of animals used, as an initial 
screening to rank or set priorities for further toxicity 
testing, and to help extrapolate the effects of contam¬ 
inants across species [54]. 
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Glossary 

Abstract community A group of organisms that 
recurs on the landscape with an implication of 
a level of integration among its parts that in extreme 
could be called organismal or quasi-organismal 
(see: Concrete Community). 

Chronosequence A space-for-time substitution in 
which the successional vegetation is ordered in 
a regular fashion. An example would be the series 
of vegetation at the foot of a receding glacier or 
a series of sand dunes ordered in regularly aged 
series. 

Clementsian succession An explanation of succession 
emphasizing the attributes of the community as if it 
functioned like a single living organism 
(see: Abstract Community). 

Climax community An ecological community asso¬ 
ciated with a particular climate and in a state of 
dynamic equilibrium with the climate. The Ameri¬ 
can ecologist, F.E. Clements, designated the climax 
community as the endpoint of ecological succession 
for a given climate condition and noted the com¬ 
mon Greek root /cAi^a ( clima) or inclination, in 
both climax and climate as indicating their close 
relationship. 

Concrete community The collection of organisms 
found at a specific place and time (see: Abstract 
Community). 


Gleasonian succession theory An explanation of suc¬ 
cession emphasizing the importance of the attri¬ 
butes of individual organisms as the fundamental 
basis. 

Indicator species Plants or animals whose presence 
implies the past or present conditions at a given 
location. 

Monoclimax The concept that one mature vegetation 
will eventually be produced by successional 
processes in a given region. 

Polyclimax The concept that multiple mature, stable- 
vegetation types can develop from the successional 
processes operating in a given region. 

SIGMATIST A school of vegetation science emphasiz¬ 
ing the classification of vegetation. 

Time-for-space substitution The collection of vege¬ 
tation data from different locations at which suc¬ 
cession has been initiated at different times in the 
past to piece together the pattern of succession. 

Definition of the Subject and Its Importance 

“Ecological Succession” is an ordered progression of 
structural and compositional changes in communities 
toward an eventual unchanging condition, the climax 
community [1-3]. The term “Community” is used in 
two ways [4]. The “Abstract Community” refers to an 
abstract group of organisms that recurs on the land¬ 
scape, a definition, which usually carries with it an 
implication of a level of integration among its parts 
that in extreme could be called organismal or quasi- 
organismal; the “Concrete Community” concept refers 
to the collection of organisms found at a specific place 
and time. These terms and their meanings are topics of 
significant debate among ecologists, both historically 
and today [5, 6]. These differences in the meaning and 
cause of ecological succession strongly affect the for¬ 
mulation of policies for ecosystems management and 
restoration. 

Succession is important in sustainability science for 
several reasons: Many of the laws and regulations 
regarding the maintenance and restoration of terres¬ 
trial ecosystems contain assumptions that are products 
of different theories as to how ecological succession 
proceeds in a given environment. Laws might require 
restoration to a natural condition, which is often the 
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vegetation that is considered the stable single vegeta¬ 
tion type on a map that is itself based on the idea that 
vegetation undergoes succession a vegetation type 
appropriate to the climate (the climax vegetation). 
Further, the management of land systems often has as 
their basis concepts as to how succession works. For 
example, if one subscribes to a successional theory that 
abused land under an inappropriate management 
scheme will return to its natural state if the inappro¬ 
priate management is relaxed, then one can utilize land 
until monitoring indicates the ecosystem is over 
stressed. If an alternate successional theory is applied, 
that abused land may not recover to its former when 
released from inappropriate management, then a much 
more cautious form of management is needed. 

Introduction 

Ecological succession and the dynamic change of com¬ 
munities is one of the pillars of modern ecology and it 
has considerable influence on sustainability science. 
The processes that drive succession and the regularly 
of patterns of succession have been the topic of consid¬ 
erable debate over the past century - due in no small 
part to the importance of the concept to basic and 
applied ecology. Succession is at its simplest level the 
pattern of change in ecological communities and these 
changes can be read on the landscape if one knows 
regional history or conversely can be used to interpret 
a landscape’s history. In the former case, the historical 
pattern of land abandonment in the piedmont of the 
southeastern United States presents land elements 
abandoned and never reclaimed since the US Civil 
war in 1860, land left fallow in the First World War; 
farms left to ruin in the crushing economic Great 
Depression of the 1930s; more land left when the sons 
of farmers left for the Second World War; still other 
farm land taken out of production in agricultural pro¬ 
grams reduce farm acreage in the 1950s. The landscape 
as one drives and automobile through Georgia or the 
Carolinas, is a kaleidoscopic montage with similar par¬ 
cels of land cover types replicated on locations with 
similar history. In the latter case, a small patch of pine 
(Pinus sp.) trees on a slope bears witness to recovery 
from a past wildfire or a Beech ( Vagus ) forest of great 
stature indicates an area that has not burned, perhaps 


for centuries. The dependence of pine species on fire for 
regeneration or the sensitivity of the thin-barked beech 
to fire is written into the landscape response. 

At the turn of the twentieth century, an American 
plant ecologist, F.E. Clements, framed many of the 
definitions and concepts in ecological succession and 
produced a theoretical system of concepts that 
explained the nature and function of ecological com¬ 
munities changing over time. Clements’ definitions 
and approaches to understanding the dynamics of 
landscapes were used to develop regional systems 
of land management policies based on expectations, 
succession, and successional recovery. Clements also 
developed a niche-based concept of using what he 
called “indicator species” to evaluate the history and 
condition of land units [2]. For example, an 
overabundance of bitter or poisonous herbs might be 
indicators of overgrazing in a prairie systems; the 
occurrence of other species might indicate unusual 
soil chemistry and thus the presence of valuable min¬ 
erals in the bedrock. The details of Clements’ ideas were 
debated strongly by his contemporaries and other ecol¬ 
ogists in the 1920s and 1930s found differences with 
parts of his successional concepts. For example, 
H.A. Gleason [7, 8] argued that the highly coordinated 
interactions in Clements concepts had at their basis the 
unique attributes of the individual plant species. These 
will be discussed below. Perhaps the most sharply 
pointed criticism was from the British ecologist, 
A.G. Tansley, who argued that the Clements’ view of 
succession had so clouded the meaning of the term, 
community, that a new term, the ecosystem [5], was 
needed to replace and properly formalize these ideas. It 
is significant in understanding the emphases in ecosys¬ 
tem studies to note that A.G. Tansley was a coauthor to 
the 1899 edition of Spencer’s Principles of Biology, the 
leading English biology text of the late nineteenth cen¬ 
tury. Tansley’s definition of the ecosystem and his lack 
of acceptance of Clements’ and Phillips’ holistic view 
that the ecological community was a coherent, organ- 
ism-like entity was not a rejection of a holistic view of 
ecology, it was a debate among holistic ecologists [9]. 
These and other continuing and sometimes rancorous 
[6, 10] debates will be discussed in more detail below. 

The development of the attributes of ecological 
succession has from its beginning had two poles: one 
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Clementsian pole, which emphasizes holism, great 
interconnectedness of ecological systems and a 
successional progression toward an ultimate stable eco¬ 
logical system with an optimization of its attributes; 
a second Gleasonian pole, which sees the properties of 
ecological systems as deriving from the individual 
attributes of the organisms that comprise it and their 
interactions with their environment. The Gleasonian 
view is less dogmatic about the necessary outcomes of 
succession. Of course, sustainability science ultimately 
must take the measure of the dynamics of complex 
ecosystems, the natural and direction of their change 
under management. Debates among ecologists as to the 
nature of succession must be understood by scientists 
seeking to understand sustainability. 

It is important to note that through the history of 
this strong Anglo-American debate on the nature 
of succession, there has been a continental European 
effort toward the goal of developing a taxonomic 
tradition in the issue of classifying vegetation into mean¬ 
ingful units, which is also an essential element of sus¬ 
tainable management. The so-called Zurich-Montpellier 
school (or as later called the SIGMATIST school) orig¬ 
inated in the 1890s with work by C. Schroter and Charles 
Flahault. This effort was taken up and lead by J. Braun- 
Blanquet. This tradition emphasized a floristic descrip¬ 
tion of vegetation and had the ultimate goal 
a classification of “associations” as the basic vegetation 
units of interest [11]. This interest in vegetation classi¬ 
fication as a basis of mapping and prediction continues 
to the present. Significantly, vegetation classification is 
an ultimate element in organizing management plans 
and hence for sustainability as well. 

Historical Development of the Concept of Ecological 
Succession 

Different scientists in different locations have observed 
these patterns and drawn rather different conclusions 
about the root causes of succession and to the degree 
that it repeats itself over time and space. Figure 1 illus¬ 
trates the relationships among several ecologists who 
have developed what might be best described as 
“iconic” views of succession. The two axes in the figure 
are the level of holism in the theories and the emphasis 
on the degree of importance of feedbacks between the 
biological community and the environment. 


Increasing Level of Holism 



Ecological Succession and Community Dynamics. 

Figure 1 

Iconic ecologists and their emphases on the attributes of 
successional systems 

All of these scientists are holistic in that they 
focused on the larger system behavior of ecological 
systems. The most holistic of these scientists would 
see the community as a highly organized, perhaps 
highly coevolved systems described as an abstract com¬ 
munity in the definition section of this paper (above). 
They also vary on the degree to which they emphasize 
feedbacks between the organisms and their 
environment. 

F.E. Clements The iconic scientists in Fig. 1 represent 
different points of view in the formative debates on the 
nature of ecological succession in the formative period 
in the first third of the twentieth century. Clements was 
in some sense the dominant figure in these discussions. 
A.G. Tansley who represented strong opposition to 
many of Clements’ fundamental concepts, wrote Clem¬ 
ents obituary [12] for The Journal of Ecology, 

► Though out of accord with many American ecologists, 
Clements had, throughout his career, a devoted follow¬ 
ing of younger men, and besides exercising a world¬ 
wide influence through his theory of vegetation, he 
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directly inspired a great deal of American ecological 
research. This was partly due to his powerful personal¬ 
ity. He was decidedly puritan, even ascetic (he neither 
drank nor smoked, and it gave him real pain to see 
other people doing so), and his manner was apt to be 
tinged with a certain arrogance. These things naturally 
antagonized many people; but at the same time his 
capacity for hard work, his intense and complete devo¬ 
tion to his subject, his powerful intellect and unremit¬ 
ting search for fresh knowledge and satisfactory 
formulations, could not fail to inspire the highest 
respect. 

Clements’ concept [3] saw secondary succession as 
being initiated by an event that removed all of the 
vegetation. Nowadays, this would fall under the term 
“disturbance” [13] . Clements referred to this process as 
nudation. Seeds, spores, etc., would then disperse to the 
site in the process of migration. These would germinate 
and become established in a collective set of processes 
that he called “eccesis.” The resultant plants would then 
interact with one another, notably through competi¬ 
tion. Clements called these collection of interactions, 
“coaction.” Successful plants would interact with the 
microenvironment at the site and, in a process called 
“reaction,” would modify the local site conditions as to 
make it unsuitable for the plant species that were there 
and more suitable for other species, which would 
replace them. This interaction is often called “facilita¬ 
tion” in modern ecology texts. The plant communities 
generated by this process are called “seres” The 
feedback between each sere and the environment pro¬ 
duces an ordered sequence of seres or serai stages of 
succession. This reaction-replacement process would 
continue until an assemblage of species, which were 
able to occupy the site after the plant-environment 
reaction process, became dominant (“stabilization”). 
This ultimate assemblage was the “climax community.” 

Clements’ climax community concept is incorpo¬ 
rated almost subliminally in a number of the ideas in 
plant ecology and landscape management, particularly 
in the Western United States. Maps of “potential 
vegetation” are often maps of climax vegetation in the 
style of Clements. This is reinforced by the two- 
dimensional nature of paper making it cumbersome to 
print maps with more than one potential vegetation in 
a given map location. Management of parks toward 


a “natural” condition often has a “climax community” 
concept as an underlying basis. Decisions to sustainably 
conserve land are straight-forward if there were a climax 
vegetation but should be difficult for successional seres, 
which might be expected to change over time. While 
Clements’ concepts are ubiquitous in land management, 
several ecologist have noted difficulties with the concept. 
More frequent criticisms of Clements’ concepts include: 

1. The idea that the sere/environment feedbacks pro¬ 
duce a necessary ordering of communities toward 
the climax community. Clements [11] noted, “The 
climax formation is the adult organism of which all 
the initial stages are but stages of development.” 
Some of the more ardent Clementsian ecologists 
[14-16] likened the sequence of seres in succession 
as a direct analog to the embryological development 
of an organism. The rejection of this concept of the 
community as a superorganism concept by Tansley 
[5] motivated the invention of the neologism, 
“ecosystem,” as a replacement for what he felt was 
a corrupted term, “community,” in 1935. 

2. The concept of facilitation. Facilitation has been 
attacked from evolutionary arguments to the effect 
that one would not expect a species to evolve to 
facilitate the success of a replacing competitive spe¬ 
cies. Field observations of species replacements 
support the existence of facilitation such as the 
presence of plants with nitrogen-fixing symbiotic 
organisms increase the nutrient status of a site and 
advantaging other species. There are also numerous 
counterexamples of species that hold sites 
tenaciously over generations before finally losing 
to competitors [6]. 

3. The progressive ecesis of different species relayed to 
the site for all the serai stages. In some successional 
sequences, all the species are present at the site from 
the first nudation event. In these cases, the progres¬ 
sion of serai stages is a consequence of the differ¬ 
ences in rate of maturation of herbaceous plants, 
shrubs, and trees [10]. 

4. The existence of a single stable climax community 
reached by a single sequence of seres. Several ques¬ 
tions have arisen about the climax concept. Is there 
a single climax community or is there always change 
in the vegetation at a site? [17] Can there be multi¬ 
ple stable climaxes? [18-20] Can there be several 
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pathways to a particular community? [21] Can the 
pathways fork? Are the seres comparable at different 
locations? 

These criticisms continue in no small part because 
of the importance of Clements and his students in the 
formulation of a grand theory of vegetation dynamics 
and their ability to translate their concepts into practi¬ 
cal applications. Subsequent recognition that past 
conditions could leave stable relict communities 
(e.g., preclimaxes such as grassland patches in temper¬ 
ate deciduous forest). Human or natural disturbances 
(such as occasional wildfires) could leave in a region 
a persistent vegetation that could be expected to change 
to a climax vegetation if the disturbance were con¬ 
trolled (“disclimax”). Unusual soil conditions or vari¬ 
ations in topography could produce persistent 
vegetation that was different from the regional stable 
vegetation (“preclimaxes and post climaxes”). These 
elaborations of the climax concept inspired a 
proliferation of conditional climax vegetations [3]. 
These extended definitions of the Clementsian succes¬ 
sion concept that essentially preserved the ideas of 
a single succession to a climax community through an 
ordered series of serai communities, also allowed for 


special conditions (disturbance regime, human use of 
the land, historical antecedents and soils) to create 
deviations from this basic model [22]. 

The debate over these issues and its continuation 
over almost a century in some cases, speaks to the 
complexity of the issues and the importance of under¬ 
standing succession in designing concepts to manage 
natural systems sustainably [11, 23]. The issues raised 
by the scientists in Fig. 1 are illustrated in Fig. 2 as 
deconstructions of the standard definition of ecological 
succession. 

H.A. Gleason and the Individualistic Concept of 
Succession Henry Gleason represents a logical foil 
to Clements holistic idea of progression of succession 
toward a climax vegetation. In total, his publications 
were strongly focused on plant taxonomy [24] but the 
relatively smaller section of ecological papers that he 
produced had remarkable impact on his colleagues, an 
impact that continues to the present day. As early as 
1910, Gleason [25] noted, “... it is impossible to state 
whether there is one definite climax vegetation in each 
province; it seems probable that there are several such 
association each characteristic of a limited proportion.” 


How much variability can be tolerated 
before the apparent order in the 
progression becomes recognized as 
dicordered or chaotic? 


Is the pathway of successional change ordered 
because the mechanisms that cause succession 
need to occur in a proper sequence or can the 
steps in the order be skipped in some cases? Is 
there more than one ordered progression? 


/ 


Ecological Succession is an ordered progression of 
structural and compositional changes in communities 
toward an eventual unchanging condition, 

the climax community. 


I 


Can this condition be reached 
before the dynamic climate 
changes? If so, is there more 
than one stable climax 
community? 


The term “ecosystem” was invented to 
replace “community” because some thought 
that viewing the community as a direct 
analogue to a living organism — with 
succession an equivalent to embryological 
development — was scientifically and 
conceptually unwarranted 


Ecological Succession and Community Dynamics. Figure 2 

Some of the basic issues in the nature of ecological succession arising from a standard definition 
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Gleasons view of succession differed significantly from 
that of Clements. He recognized as early as 1908 [26] 
that succession to be retrograde as well as progressive as 
posited by Clements. Succession was not necessarily an 
irreversible trend toward the climax community. 
Gleason also recognized that the climate could change 
in contrast with Clements’ theory of vegetation under¬ 
going succession in an unchanging climate. Gleason 
developed what he called the “individualist concept” 
that succession is the result of environmental require¬ 
ments of the individual species that comprise the 
vegetation. He also noted [27] that “... no two species 
make identical environmental demands.” 

Gleason’s concept of succession was a much more 
fluid and much less stereotypical concept of succession 
than that of Clements. Succession reflected the interac¬ 
tions of individuals with the environment. It could 
change in its nature with different climatic and other 
environmental conditions. It could progress or regress 
to a more different, stable community depending on 
time and circumstance. 

Henry Chandler Cowles One of the founders of eco¬ 
logical concepts of plant communities in the United 
States was the geologist turned ecologist, Henry C. 
Cowles, at the University of Chicago [11]. Cowles 
[28] studied the pattern of ecological succession in 
a set of sand dunes, the “Indiana Dunes,” which are 
now protected by the State of Indiana’s “Indiana Dunes 
State Park.” These dunes were formed sequentially 
from sediment that was exposed with variation in 
Lake Michigan. The oldest of the dunes, which is the 
furthest away from the Lake Michigan beach, has 
a loamy brown forest soil and is covered by a Beech- 
Maple ( Fagus-Acer ) forest; the youngest dune at the 
beach front has windblown sand as a soil and is covered 
with a patchy beach-grassland. Cowles recognized this 
entire land pattern represented a chronosequence, with 
the spatial location of the dunes of different ages 
representing different stages of successional develop¬ 
ment. He later [29] produced the now famous phrase 
that “succession was a variable chasing a variable” - the 
changes in vegetation chase the changes in the climate. 
This insight was, and remains, a remarkable concept of 
vegetation function. What he meant by this was that 
the rate of change of vegetation succession, the variable, 
is sufficiently slow that one could expect the climate, 


the chased variable, to change by the time the 
succession process was completed. 

Cowles was a remarkable and innovative scientist. 
He was honored in the 1935 issue of Ecology ; the journal 
of the Ecological Society of America. The Henry Chan¬ 
dler Cowles issue ( Ecology ; Volumel6, Number 3) is 
a collage of the central issues in plant ecology in its 
formative years: C.C. Adams, [30] an early researcher of 
ecological succession and the associated bird commu¬ 
nities, along with F.E. Clements, [31] wrote on human 
ecology and ecology in the public service; Transeau 
[32] discussed the “Prairie Peninsula,” a region of 
prairie vegetation jutting into a forests in the American 
Midwest, and emphasized that one must consider past 
climates as well as present climates to understand veg¬ 
etation; Fuller [33] and Sears [34] made much the same 
point looking at paleoecological data; several students 
and colleagues of Cowles presented data on mature 
vegetation in different areas. 

A.G. Tansley and the Ecosystem Concept The first 
of the papers in the Cowles issue and different in tone 
from the rest, was Tansley’s now classic 1935 paper on 
“The use and abuse of vegetational concepts and 
terms.” Tansley [5] discussed the ideas of the Ameri¬ 
can, F.E. Clements, and the amplification of these ideas 
by the South African, John Phillips. Tansley declared 
his strong disagreement with three of Phillips’ papers in 
the Journal of Ecology [14-16] characterizing ecological 
communities as “quasi-organisms” whose succes¬ 
sional dynamics were analogous to embryological 
development, “The community is born, grows, 
matures, reproduces, and carries out various other 
biotic phenomena: it behaves in a manner similar to 
an individual, with obvious and natural differences 
inherent in its wholly divergent and far more complex 
structure, constitution and functions. It behaves in 
such a way as to justify the view that not only is it 
similar to an organism, but that it is a kind of organ¬ 
ism ...” This conceptualization is in keeping with the 
principle of holism from Aristotle’s Metaphysica 
onward and often abbreviated as “the whole is greater 
than the sum of its parts.” Phillips attributed many of 
his ideas to Jan Smuts’ book, Holism and Evolution [35] 
and noted later in writing a eulogy for Clements, [36] 
that his colleague Smuts was profoundly influenced by 
the writings of Clements. 
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Tansley created the ecosystem concept as in contrast 
to Phillips’ and Clements’ views. It is significant that in 
his iconoclastic “ecosystem” definition, he emphasized 
ecosystems “were of the most various kinds and sizes.” 
In doing so, Tansley basically defined an ecosystem as 
what systems scientists would nowadays call a system of 
definition : an arbitrary system defined by the specific 
considerations for a particular application [37]. 
Tansley’s ecosystem definition conforms well to more 
mathematical, interactive-system concepts in other 
sciences. His definition includes the intrinsic 


consideration of scale found in other sciences, partic¬ 
ularly physically based sciences. 

Tanssley also endorsed the concept of the 
“polyclimax”, the possibility of multiple stable ecosys¬ 
tems following from succession and a logical contrast 
to the Clements idea of the “monoclimax” as the even¬ 
tual product of successional processes working in 
a region. An example of the polyclimax is shown 
in Fig. 3 for succession after fires in the boreal forests 
in the vicinity of Fairbanks, Alaska. In this case, 
the pattern of succession following a wildfire on 


A disturbance opens up a relatively large space 



Onlly certain “pioneer” species 
are capable of becoming 
established in the open space 


Individuals of any species in the succession could establish and exist as 
adults under the prevailing conditions 


* i 


Modification of the 
environment by the 
early occupants makes 
it more suitable for 
the recruitment of 
“late-successional” 
species 
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Tolerance Model 
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The sequence continues until 
the current resident species 
no longer facilitate the 
invasion and growth of 
others species or until no 
species exists that can invade 
and grow in the present of 
the resident 


i 


If external stresses are 
present early colonists 
may be damaged (or 
killed) and replaced by 
species which are 
more resistant 


Ecological Succession and Community Dynamics. Figure 3 

Three different models of ecological succession from J.H. Connell and R.O. Slatyer [19] depicting succession as arising from 
the resident species at a location encouraging the colonization of a subsequent set of species (Facilitation Model), by the 
resident species being relatively neutral to the other species (Tolerance Model), or by the resident species inhibiting 
the colonization by other species (Inhibition Model) 
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south-facing slopes eventually leads to a White Spruce 
(Picea glauca) forest that is quite different in its com¬ 
position and character from the Black Spruce (Picea 
marina) forest seen on north-facing slopes. To some 
degree, Clements and his colleagues used terms such as 
disclimax, etc. (see above) to handle these sorts of cases 
in their monoclimax theory of succession. 

A.S. Watt One of Tansley’s outstanding students was 
Alexander Stuart Watt. He developed his doctorate in 
1924 on the dynamics of Beech (Fagus sp.) forests [38]. 
He noted the structure of the beech forest was at the 
scale of the canopy controlled by a dominant large tree 
best characterized as a cyclical process of the regenera¬ 
tion in the patch of well-lighted area that followed the 
death of a canopy tree, the growth of these recruits and 
their competition with one, and the development of 
a dominate canopy tree in that location. The death 
of this large tree would then reinitiate the cycle. By 
1947, Watt had become the president of the British 
Ecological Society and delivered one of the most 
important papers in ecology as his presidential address. 
This presidential address and the subsequent paper 
represented a synthesis of what nowadays might be 
called the “big picture” science view of his doctoral 
advisor, A.G. Tansley, with Watt’s own insights based 
of his field experience and broad ecological interests. In 
his address, Watt represented the plant community as 
a “working mechanism” of interacting plant processes 
such as regeneration, growth, and death on individual 
plants producing the broader-scale pattern of plant 
communities. He stated, “The ultimate parts of the 
community are the individual plants, but a description 
of it in terms of the characters of these units and 
their spatial relations to each other is impractical at 
the individual level. It is, however feasible in terms of 
the aggregates of individuals and species which 
form different kinds of patches: these patches form 
a mosaic and together constitute the community. Rec¬ 
ognition of the patch is fundamental to an understand¬ 
ing of the structure as analyzed here” [39]. To illustrate 
this communality in the underlying working dynamics 
of vegetation, he compared a diverse array of plant 
communities - heathlands, grasslands, bogs, montane 
vegetation and forests. 

As was the case with Gleason, Watt noted that one 
must account for the results of individual plants 


interacting with one another, but he also felt such an 
accounting was “impractical at the individual level.” 
Twenty years later, the explosive expansion of 
computational power that still continues to date has 
made such computations more and more feasible. With 
the development of increasingly powerful digital com¬ 
puters, starting in the 1960s and continuing to the 
present, several different scientific disciplines (physics, 
astronomy, ecology) independently began to apply 
computers to the tasks of “book-keeping” the changes 
and interactions of individual entities. Forest “gap” 
models (which will be discussed below) were one of 
this class of individual-based models or IBM’s that 
brought Watt’s insights on vegetation dynamics into 
a form that could be projected using computer 
simulation techniques. 

Attempts at Synthesis: Connell and Slatyer The per¬ 
sistent dichotomy represented by Clementsian versus 
Gleasonian views of succession has consistently invited 
attempts to synthesis toward a unified concept. 
J.H. Connell and R.O. Slatyer [19] developed a 
synthesis that emphasized the individual attributes of 
plants and their feedback with their environment. 
Clementsian succession was a special case, which they 
called the “Facilitation Model” (Fig. 4). The replace¬ 
ment patterns in the Connell and Slatyer model were 
for sets of species, which also implies to some degree 
a replacement of one community with another, which 
also is more of a Clementsian concept. Succession was 
driven by transfers from one set of species as the facil¬ 
itation model in which succession as arises from the 
resident species at a location encouraging the coloni¬ 
zation of a subsequent set of species, as the tolerance 
model in which the resident set of species are relatively 
neutral in their interaction with other species, or as the 
inhibition model in which the resident species block 
the colonization by other species (Fig. 4). 

Attempts at Synthesis: E.P. Odum Eugene Pleasants 
Odum was a significant figure in what Robert McIntosh 
called the rise of ecosystems science. In 1953, he wrote 
the first modern ecology textbook, [40] which was the 
standard text for an emerging formulation of ecology as 
a formal academic discipline. He and his brother 
H.T. Odum were instrumental in the use of energy 
transfers in ecosystems (ecological energetics) as 
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Ecological Succession and Community Dynamics. Figure 4 

Ecological succession following wildfires in the vicinity of Fairbanks, Alaska (USA). Left. Ecological succession of a south 
facing slope. Right. Ecological succession on a north- facing slope [78] 


a common currency for comparing ecosystems with 
one another. As the sons of a crusading sociologist, 
H.W. Odum, the two Odum brothers were drawn to 
include “human ecosystems” in their studies, particu¬ 
larly the energetics of human society and its interac¬ 
tions with the natural ecosystems. The impact of 
ecological energetics on ecology was profound in that 
it promoted large-system intercomparisons. The Inter¬ 
national Biological Programme, the first “big picture” 
synoptic study of ecosystems used ecological energetics 
as an organizing principle for projects that compared 
forests, grasslands, and other biomes, worldwide. These 
comparisons were often made using “compartment 
models,” diagrams in which the transfers of energy in 
units of such as Kcalm 2 year -1 were indicated by arrows 
connecting boxes which expressed the integral of the 
arrows such as Kcalm 2 . Compartment models were 
easily transformed into ordinary differential equations 
for model projections 

Also in the 1950s and 1960s, the United States 
Atomic Energy Commission (USAEC) sponsored 
a large program studying the movement of radioactive 
isotopes in natural environments including whole eco¬ 
system experiments and direct measurements of rates 
of transfer of isotopes through natural environments 
[41]. Large, linear compartment models developed in 
pharmacology [42], and used in ecological energetics 


were developed to predict the ecological transfer of 
these materials. There were several advantages to such 
linear models. They (even for complex linear models) 
can be solved at equilibrium by algebraic manipula¬ 
tions, an advantage given the modest computational 
power available in the 1950s and 1960s. When the 
transfer of material between compartments is a fixed 
percentage of the amount of material in a source 
compartment (called donor-controlled flow), linear 
models have the property that, with a constant input 
of material into the system, the equilibrium levels of 
materials are also the maximum amount. Importantly 
for models of radioisotope transport, radioactive decay 
of the isotope is a linear, donor-controlled process. 
Models of this formulation continue today in applica¬ 
tions to determine the transfer of radioactive material 
to human populations from reactors or nuclear 
accidents such as the Chernobyl event. 

In 1969, E.P. Odum wrote “The Strategy of Ecosys¬ 
tem Development,” [43] a synthetic paper strongly 
based on ecological energetics and compartment 
models in ecosystems. The emphasis of the paper 
was strongly in the area that nowadays might be 
termed as sustainability science as is evidences by its 
first lines, “The principles of ecological succession 
bear importantly on the relationship between man 
and nature. The framework of successional theory 
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needs to be examined as a basis for resolving man’s 
current environmental crisis.” Odum’s view of succes¬ 
sion was summarized as a tabular model of ecological 
succession (Table 1). It was strongly Clementsian with 


Ecological Succession and Community Dynamics. 
Table 1 A "tabular model" of changes expected over 
ecological succession from E.P. Odum 


Ecosystem attributes 

Early 

succession 

Late 

succession 

GPP/respiration 

> or <1 

Approaches 

1 

GPP/biomass 

High 

Low 

Biomoss/energy 

Low 

High 

Net community production 
(NPP) 

High 

Low 

Food chains 

Linear 

Web-like 

Total organic matter 

Small 

Large 

Inorganic Nutrients 

Extrabiotic 

Intrabiotic 

Species richness 

Low 

High 

Species evenness 

Low 

High 

Biochemical diversity 

Low 

High 

Stratification and pattern 

Poorly 

organized 

Well 

organized 

Niche specialization 

Broad 

Narrow 

Size 

Small 

Large 

Life cycles 

Short, simple 

Long, 

complex 

Mineral cycles 

Open 

Closed 

Nutrient exchange 

Rapid 

Slow 

Role of detritus 

Unimportant 

Important 

Selection on growth form 

r-Selection 

k-Selection 

Selection on production 

Quantity 

Quality 

Symbiosis 

Undeveloped 

Developed 

Nutrient conservation 

Poor 

Good 

Stability 

Low 

High 

Entropy 

High 

Low 

Information content 

Low 

High 


GPP gross primary production (photosynthesis), NPP net primary 
production (GPP minus respiration) 

Source: From Odum [40] 


some of the superorganism emphasis of Phillips added 
in. It also included a number of ideas that were pop¬ 
ular in theoretical and population ecology at the time, 
such as the idea that biotic diversity in an ecosystem 
connotes ecosystem stability or the concept that suc¬ 
cession was inherently driven to maximize some fea¬ 
tures (total biomass, biotic diversity, ratios of energy, 
or material transfers, etc.). These latter cons¬ 
iderations can be seen as a logical continuation of 
Clements’ concept that succession was progressive in 
its direction. 

Odum’s 1969 paper generated and still generates 
strong interest in the scientific community and con¬ 
tinues to be influential these 40 years later. (At this 
writing in the end of year 2010, the Odum paper has 
been cited over 1,700 times with no diminution in 
citation rate.) As one might expect given its essential 
Clementsian theme updated with central topics in eco¬ 
systems ecology, the Odum’s paper attracted several 
negative reviews from the ecologists of a Gleasonian 
persuasion, particularly given the ascendency of ideas 
association with Gleason’s papers on succession [6]. 
Nevertheless, Odum’s work has had a strong influence 
on the direction of sustainability science. Its rules on 
how change should progress toward appropriate natu¬ 
ral goals provide guidelines to how ecological systems 
might be managed toward naturally appropriate goals. 
Its dogmatism makes it a target for criticism but also 
gives it an influential position in those wrestling with 
the difficult problem in a dynamical, strongly human- 
altered planet. 


E 


Sustainability and Vegetation Dynamics 

In the historical review above, one finds succession to 
be a complex concept with different interpretations as 
to its causes and attributes over time. Ecologists such 
as F.E. Clements and E.P. Odum viewed succession as 
a process that drove change in vegetation everywhere. 
The generality of their application invited the inspec¬ 
tion of local examples and the discovery of “exceptions 
to the rule” that could whittle away at the claimed 
generality. They also were drawn to applying these 
principles in understanding human changes on land¬ 
scapes. Other ecologists such as H.A. Gleason and 
probably the majority of modern ecologists active in 
this area of research today developed a much more 
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diffuse notion of succession. How do these ideas con¬ 
tribute to sustainability sciences and how can they be 
applied? 

Clementsian Succession and Sustainability Science 

The importance of succession for sustainability science 
can be illustrated by two examples: The first is from the 
historical role of succession theory in land-use policy in 
the United States; the second involves the incorpora¬ 
tion of succession theory into environmental policy. 
Both examples favor Clementsian successional 
approaches in application in sustainability science. 
A following section will discuss the application of 
more Gleasonian approaches to sustainability science. 

Succession and the United States Land-use Policy 

The importance of succession theory in the land-use 
policy of the United States is can be illustrated in 
human connections associated with F.E. Clements. 
Clements’ doctoral advisor at the University of 
Nebraska was Charles Edwin Bessey, a leading botanist 
of his era. Bessey came to the University of Nebraska in 
1884. He was a leading college administrator; he wrote 
high school and college textbooks; he served as editor 
for the journal Science. He produced a remarkable set of 
students: the Pulitzer Prize winning novelist, Willa 
Cather; Frederic Clements and his wife, Edith; and 
Roscoe Pound, the leading jurist of his time. Pound 
and Clements wrote several papers together. 
Pound went on to the faculty at Harvard University, 
became Dean of the Harvard Law School, and eventu¬ 
ally served as an advisor to President Franklin Roose¬ 
velt involved in the structuring of the “New Deal” 
reformulation of the structure of American 
government. Among the “New Deal” legislation came 
administrative structures and regulations for land-use 
policies, conservation and land protection, soil conser¬ 
vation, and other topics which nowadays would clearly 
fall in the rubric of sustainability. Pound eventually 
wrote an obituary/recollection for F.E. Clements [44]. 

The connections among these remarkable individ¬ 
uals from American heartlands are indicators of 
a deeper relationship between the ecology of Clements, 
his associates and students, and land-use policies in the 
United States. The Great Depression of the 1930s in 
America was acerbated by a collapse of agriculture in 
the “Dust Bowl” states through the middle of the 


nation. A complex combination of persistent land 
abuse, a climatic anomaly in the form of a drought, 
and bank collapse accompanying the collapse of family 
agriculture turned dryland farmers into unemployed 
refugees. The universities of the Midwest and the 
emergence of focused studies, such as the “Botanical 
Seminar” of Bessey at Nebraska (which had collected 
the Clements, Pound, and other students), had been in 
part a larger problem of understanding the difficulties 
in agriculture in the United States as farming moved 
westward from the forest of the East to the central 
grasslands [45]. Range management as an agricultural 
science grew from this tradition and particularly from 
the work of Clements and his colleagues [46, 47] . Thus, 
the emerging science of range management developing 
at the universities at the center of an ecological disaster 
involving the dynamics of grassland ecosystems pro¬ 
duced information and applications on what could and 
should be done to restore stability. Political and policy 
advisors, such as Roscoe Pound, were in a position to 
translate this science and to put into laws concerning 
the sustainable use of grasslands, rangelands, and other 
ecosystems - in American and to a degree, worldwide. 

Clarity of Clementsian Succession and the Formula¬ 
tion of Policy Environmental policy like most polit¬ 
ical processes eschews complexity. One of the features 
of modern is the need for “elevator speeches” - sum¬ 
maries of complex issues that can be delivered to 
a listening decision-maker in the length of time one 
might have riding an elevator to a meeting. These and 
the even shorter snippets of information present to the 
press convey the idea that good ideas necessarily can be 
expressed succinctly. Certainly the Principle of Parsi¬ 
mony or “Ockhams Razor” in the sciences, the idea that 
one should select the hypothesis that makes the fewest 
additional assumptions when hypotheses are otherwise 
equal and a founding concept in modern science, lends 
itself in some interpreters to this point of view. Readers 
of F.E. Clements works are almost always impressed by 
the formidable richness of details in his writings, which 
he produced to support relatively clear general state¬ 
ments as to how succession as a unified process works. 
E.P. Odum, in some sense an inheritor and elaborator 
of Clements concepts, wrote using rich analogies to 
convey simple principles that operate in complex 
systems. 
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Clements’ and Odum’s writings share the ability to 
convey complexity with simple principles has served to 
make succession a tangible concept for policy makers 
and to illustrate the application of these principles. 
These “ecological generalizations as principles” have 
been widely used to impress upon the general public 
an appreciation of a unified ecological view of succes¬ 
sion. The seemingly less-focused, special-case orienta¬ 
tion of Gleason and subsequent ecologists seems 
complicated by exceptions. Range management 
research, which today still has a Clementsian theme 
and which has been successful in producing widely 
applied and useful policies for sustainable land man¬ 
agement, is a tangible example that this approach can 
work. It is being challenged with its application in 
developing nations and in areas with old soils 
and variable climates but it remains the central 
paradigm [48]. 

Even though most modern ecologist would see 
Gleasonian views of succession as most appropriate, 
the findings do not always lend themselves to the 
“elevator-speech” test. One might ask, “What does an 
application of Gleasonian succession theory in sustain¬ 
ability look like in its application?” The answer is in 
modern forest and the very root of the origin of the 
word sustainability. This is the topic of the following 
section. 

Nachhaltigkeit, Sustainability and Modern Forestry 

Throughout Europe in Medieval times there was 
a substantial clearing of forests and extensive regional 
deforestation [49]. As forests, conflicts over the use 
products of forests intensified. This eventually pro¬ 
duced a class conflict and subsequently laws against 
poaching of animals, thieving of wood, and proscrip¬ 
tions against public use of forests, in general. In the 
mid-eighteenth century, a forest management concept 
called “Nachhaltigkeit” (translated into English as 
the neologism, “sustainability”) was developed by 
the Germans. From about 1,800, this new forestry 
practice spread over Europe, particularly Northern 
Europe. Nachhaltigkeit involved detailed determina¬ 
tion of how to best manage forests to produce wood 
and other goods in perpetuity. In German, the root 
Nachhaltigkeit alone means nothing more than having 
a lasting or sustained effect. 


Nachhaltigkeit involved detailed determination of 
how to best manage forests to produce wood and other 
goods. Essential to this objective was manipulating den¬ 
sity by spacing trees on a given site, either by planting 
trees or by thinning a naturally regenerated stand of trees 
following a timber harvest or a natural disturbance. 
Additionally, one needed to determine how long one 
should wait before harvesting a stand of trees and then 
planting a new stand. This spacing/length-of-rotation 
problem had long been solved for crop plants through 
experimentation and observation. To produce trees as 
long-lived crops, elaborate long-term data collection 
started on the height; size in diameter; amount of 
wood; size of crowns in forest stands of different densi¬ 
ties at sites with different environmental conditions. 
Eventually, a forest modeling concept called the “yield 
table” approach developed and became the signature of 
modern forestry. Nowadays, some of the historical forest 
data-sets used in yield tables have grown to 200+ years of 
continual record [50]. 

On a given kind of site (same soils, same rainfall, 
etc.), trees growing in even-aged, single-species stands 
(such as forest plantations) tend to grow to the same 
height at a given age, regardless of density of trees 
[51, 52]. At low densities, trees have with large diame¬ 
ters and crowns and on an equivalent high-density 
location, the trees would have small diameters and 
crowns - but the heights of the trees would be the 
same in both cases. In a yield table, decades and some¬ 
times centuries of forest stand data are arranged by the 
height of the trees at a given location reach at a given 
age, usually the typical age of tree harvest. The tree 
height at this standard age is called a “site index” and 
is used to signify the overall quality of a location for 
growing trees. A site index is the canopy height one 
would expect a forest to attain in a reference length of 
time. If, for example, the dominant trees in the canopy 
of a forest attain a height of 30 m in 50 years 
(the reference time) then the forest would have a site 
index of 30. Sites with larger site indices would reach 
the 30 m height earlier than 50 years; sites with lower 
site indices would reach 30 m later (if at all). Using yield 
tables, one can assess the volume of wood in managed 
stands of equivalent heights on different sites. 

Site index is clearly defined in terms of the basic data 
that goes into a yield table and it can be directly deter¬ 
mined by measuring the heights of trees on even-aged 
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stands at the reference age. Associating site index with 
actual plots of land is a learned skill and an art at the same 
time. A capable site surveyor can judge site index by 
reconnaissance of land in a particular region and can 
make a good wage practicing this trade. Along with 
such arcane practices as ax-throwing, log-rolling, and 
tree-felling, forestry schools have regular intercollegiate 
competitions of judging site indices among their students 
as part of “Forestry Field Day” celebrations. At the edge of 
most universities with a college of forestry, one finds plots 
of trees planted at different densities for field teaching on 
the calibration of yield tables. The yield table concept is 
the quantitative basis of modern forestry. 

One significant observation found in yield tables is 
that the rate of mortality of trees in stands undergoing 
natural thinning is highest in stands with the highest 
site indices. Faster growing trees suppress their 
competitors at a higher rate than slower growing trees 
on low site-index locations. The trade-off between 
rapid tree growth versus high mortality on good sites 
is the basis of the development of thinning strategies 
for managed stands and for schemes to plant forest 
plantations at particular spacing of trees. Despite their 
elevated mortality, sites with high site indices are more 
productive than low site-index locations. 

Nachhaltigkeit was fueled by social unrest. It 
resulted in laws and regulation based on scientific 
research and experimentation with inspiration stem¬ 
ming from a shared political necessity to better manage 
the forests of Europe. Nachhaltigkeit was also the origin 
of the initial use in English of the word, sustainability. 
Its development into modern forestry practice was 
an ongoing process that continues today. Modern for¬ 
estry is significant in sustainability science as a first 
real-world application. Modern forestry and its 
Nachhaltigkeit origins also represent a model of 
Gleasonian approaches to forest dynamics into sustain¬ 
ability science. It focuses on individual organisms and 
their relationships with one another and with their 
environment. One of the significant implications is 
that quantitative ecology and in particular, ecological 
models, likely could have a significant role in sustain¬ 
ability science. Individual-based vegetation dynamics 
models have a logical intellectual inheritance from 
forestry studies and are an example of tools for the 
application of Gleasonian successional concepts. 
These are the topics of the next section. 


Individual-Based Models: Computer Descendants 
of Gleasonian Succession 

Currently, several factors are simultaneously arising 
that challenge the application of successional theory 
in sustainability science. Climate conditions are chang¬ 
ing, perhaps with a greater rate of change than seen in 
the past [53]. The carbon dioxide composition of the 
atmosphere has been altered by human activities with 
potential effects on plant processes particularly photo¬ 
synthesis and the efficiencies of plant water-use [54]. 
This issue was addressed several decades ago by 
foresters who realized that the elaborate calibrations 
used to develop long-term data on forest change were 
subject to less accurate predictions if selected genetic 
strains of trees were used, if forests were fertilized or if 
there were significant variations in climate. 

Impressed by the power of computers and inter¬ 
ested in developing a methodology to use highly 
detailed computer models to simulate the spatial rela¬ 
tions among thinning of trees, tree growth, and spatial 
arrangement of trees, foresters developed individual- 
based dynamic models of forests (mostly for 
commercial forests) in the mid-1960s. Their computa¬ 
tional innovation was being independently paralleled 
in other fields, notably astronomy, physics, and several 
engineering sciences [55, 56]. The early individual- 
tree-based forest models were quite complex. For 
example, the competition among individual trees was 
typically simulated by crown interactions involving the 
3-dimensional geometry of each individual tree crown 
for all the trees in a stand; [57] the growth of the tree 
trunks of each tree were often simulated at multiple 
heights; [58, 59] a tree’s mortality was related to 2- or 
3-dimensional crown pruning among trees [60]. 

For vegetation dynamics, the earliest individual- 
based model of forests [57] was developed for Doug- 
las-fir (Pseudotsuga menziesii ) applications and there 
were soon several similar models developed as doctoral 
dissertations at several Schools of Forestry [37]. These 
early individual-based models of forests were inordi¬ 
nately complex, perhaps because they were applied to 
tree-spacing problems in even-aged and single-species 
forests or forest plantations. However, a most detailed 
early model [61] with multiple tree species, tree inter¬ 
actions based on interacting three-dimensional crown 
geometry, mapping exact locations of trees, calculating 
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the dispersion of seed-fall based on seed morphology, 
etc., exceeds the complexity of almost all of the 
individual-based models of today. 

As noted above, Watt noted that to understand 
succession one must account for the results of individ¬ 
ual plants interacting with one another, but that it is 
“impractical at the individual level.” Twenty years later, 
the explosive expansion of computational power that 
still continues to date has made such computations 
more and more feasible. Early versions of these models 
in ecology were developed by population ecologists 
interested in including animal behavior in population 
models [62-64] and lead to a diverse array of applica¬ 
tions for fish, insects, and birds [55]. An advantage of 
such models is that two implicit assumptions associ¬ 
ated with traditional ecological modeling populations 
are not necessary, namely that: 

1. The unique features of individuals (including their 
size and relative location) are sufficiently 
unimportant to the degree that individuals are 
assumed to be identical and 

2. The population is “perfectly mixed” so that there 
are no local spatial interactions of any important 
magnitude [55]. 


into the realm of “industrial secrets” as their developers 
found employment in the timber industry. 

In 1980, a paper reviewing some of this work [66] 
coined the term, “gap model,” to describe this class of 
models. The gap-model designation was originally 
developed to emphasize that a principal simplifying 
assumption in these models (the assumption that the 
competition among individual trees on a small patch of 
land was homogeneous in the horizontal over a small 
area of land but spatially explicit in the vertical dimen¬ 
sion) fitted well with the classic “gap dynamics” 
concept of A.S. Watt [39, 67]. At the time the term 
gap model was coined, Watt’s concept had been signif¬ 
icantly reintroduced to American ecologists by Frank 
Bormann and Gene Likens [68, 69]. A generation of 
forest ecologists have made numerous extensions of 
“gap models” and the term, nowadays, refers to 
a broad class of individual-based models of forests 
and other ecosystems of a natural character (mixed- 
age, mixed-species, natural disturbance regimes, etc.). 

An Application of an Individual-Based Model on 
Sustainability of Russian Forests under Climate 
Change 


E 


As Watt noted, most ecologists are interested in 
variation between individuals (a basis for the theory 
of evolution and a frequently measured aspect of plants 
and animals) and appreciate spatial variation as being 
quite important. Assumption that this variation some¬ 
how is uniform seems particularly inappropriate for 
trees which are sessile and which vary greatly in size 
over their life span. This may be one of the reasons why 
tree-based forest models are among the earliest and 
most widely elaborated of this genre of models in 
ecology. 

In 1972, Botkin and colleagues [65] produced an 
important simplification of the early forestry work 
(called the JABOWA model for the initials of its 
developers) and introduced this modeling approach 
to ecologists. They did not cite any of the antecedent 
forestry models and likely were not aware of them. The 
earlier forestry work remained sub rosa to most ecolo¬ 
gists. The forestry models were mostly, but certainly 
not entirely, in sources not frequently read by forest 
ecologists. Additionally, some of the forestry models 
were seen as sufficiently useful that the models drifted 


Global climate model simulations indicate that the 
Northern Hemisphere’s boreal forests and, in particu¬ 
lar, the Siberian boreal forest zone, may not only 
respond to climate change but may affect the Earth’s 
climate through feedbacks involving changes in the 
regional surface albedo, the degree to which the surface 
reflects incoming radiation. Bonan and his colleagues 
[70] altered surface albedo in order to simulate the 
clearing of the boreal forest in the National Center for 
Atmospheric Research (NCAR)’s Community Climate 
System Model version 1.0 (CCSM1). This substantially 
cooled the Earth not only in the boreal zone but across 
the Northern Hemisphere. Betts [71] used the Hadley 
Center Atmosphere Model (HadAM3) to simulate the 
climatic consequences of albedo changes from growing 
more trees, worldwide. He found that the surface 
albedo changes associated with the growth of conifer¬ 
ous evergreen trees in boreal regions led to significant 
increases in the average global temperature. These 
increases were large enough to overshadow the effect 
of the carbon storage that occurred as a result of 
growing evergreen forest in that region. 
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Field observations provide further evidence that 
changes in the boreal forest may impact the global 
climate. Larch forest, dominated by both Larix sibirica 
and L. gmelinii , covers extensive regions in Siberia. 
Shifts from larch to dark-conifer forests, dominated 
by trees such as spruce or fir that are tolerant of higher 
temperatures, have been documented [72, 73]. Because 
larch is a deciduous conifer, this shift in forest com¬ 
position would lead to the similar albedo changes as 
the evergreen tree growth simulation presented by 
Betts. This reduction of albedo associated with 
a documented shift in forest type (larch to dark coni¬ 
fer) indicates that warming temperatures may lead to 
a positive feedback response: a warmer climate accel¬ 
erates the natural succession from larch to dark- 
conifer forest; the resultant albedo change promotes 
additional warming. 

Dynamic vegetation models, specifically forest gap 
models, are ideally suited to an exploration of the 
impacts that climate change may have on the structure 
and composition of boreal forests and the existence of 
a climate/land cover feedback in this region. The 
FAREAST [74] model was run at a total of 2,083 sites 
across the former USSR. FAREAST uses monthly 
climate parameters derived from historical station 
data to compute daily temperature and update soil 
water [75]. In particular, at each site, the model’s 
climate inputs are drawn from a statistical distribution 
of monthly values for minimum and maximum mean 
temperature and precipitation which is derived from 
60 years of data recorded at local weather stations [76] . 
The model also uses values for soil field capacity and 
soil carbon and nitrogen [77]. 

The birth, growth, and eventual death of individual 
trees are determined in response to local site parame¬ 
ters such as soil moisture and nutrient availability, 
which are updated annually with bio-environmental 
conditions, soil moisture, and available nutrients. 
Individual trees compete for light and nutrients with 
stochastic processes governing the birth and death of 
trees in a circular twelfth hectare plot, which 
approximates the size of a mature tree crown. Forty- 
four individual tree species are included in FAREAST 
simulations. These species represent the genera which 
dominate Northern Eurasian forests. Each species is 
characterized by 25 parameters which describe the spe¬ 
cies’ fundamental silvics and determine which species 


has an advantage in terms of competition for light or 
nutrients, or tolerance to lack of water. At each of the 
2,083 sites, 200 independent twelfth hectare plots were 
simulated and then the modeled biomass values were 
averaged for each species in each year of the model run. 

The overall response of Russian boreal forests to 
climate change when the effects of changes in temper¬ 
ature and precipitation are separated show higher aver¬ 
age precipitation leads to increased biomass (Fig. 5a), 
lower average precipitation results in decreased bio¬ 
mass (Fig. 5b), and warming causes decreases in 
biomass for certain regions, though in parts of Siberia, 
where average temperatures are extremely low, 
warming induces an increase in forest biomass 
(Fig. 5c). There are also genera specific patterns in the 
shifts in biomass that occur across Russia. Specifically, 
there are different patterns of change for Larix spp. and 
Pinus spp. in response to temperature warming. Both 
genera display a decrease in biomass in western and 
southwestern Russia and the Russian Far East. The 
number of sites that experience a biomass decrease for 
Larix spp. is larger than the number of sites that show 
a decline in Firms spp. In particular, the sites that show 
a decline in Larix spp. extend further northward in 
both European Russia and the Russian Far East. 
A more detailed analysis is required to determine 
whether these patterns are the result of a replacement 
of Larix spp. with Firms spp. 

Future Directions 

Ecological succession has a complex and controversy 
rich history. It is significant over this history that the 
importance of successional knowledge as a prerequisite 
to the wise management of landscapes has not been 
a point of issue. The debates in succession are not about 
its importance. Rather, the debates were and are over 
how succession works, how it should be understood, 
and how it can be wisely applied. This is all the more so 
in the case of sustainability science as a user of succes¬ 
sional theory and succession concepts. To sustainably 
manage the planet, it is an absolute prerequisite to have 
a capability to predict the dynamic change of the 
world’s vegetation. To the extent that the understand¬ 
ing of succession and vegetation dynamics is not com¬ 
plete, so too will be the state of the understanding of 
sustainable management. 
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Ecological Succession and Community Dynamics. Figure 5 

Decreasing biomass is shown in pink and increasing biomass is shown in green. Figure shows difference in magnitude of 
total forest biomass (tCha -1 ) for year 200 between a baseline scenario with no change in climate, and: (a) climate scenario 
with a +10% precipitation showing an overall pattern toward increasing biomass; (b) climate scenario of —10% 
precipitation with pattern is toward decreasing biomass; (c) climate scenario with a 2°C increase with decreasing biomass 
in western and southwestern locations, but an inconsistent response across the region 


Ecological models of vegetation dynamics have 
developed to a great degree over the past several 
decades. The capability to predict change in vegetation 
is much more today as it was 50 years ago. At the same 
time, the novel and large environmental changes 
that humankind is visiting upon the biosphere 


(changes in the gasses of the atmosphere, in the diver¬ 
sity of regions, in the local to global climate, etc.) 
conspire to challenge models to predict outside the 
range of conditions in which they were developed. 
Extrapolation is always a difficult endeavor for life 
scientists. 







3294 


E 


Ecological Succession and Community Dynamics 


One would expect sustainability scientists to 
become more dependent on models when conditions 
are novel. This novelty conspires to make models 
potentially less reliable. Sustainability has a word origin 
in the word, Nachhaltigkeit. That legacy grew into 
modern forestry with a quantitative capability to 
predict the future changes of forests under active man¬ 
agement. In some senses, however, forestry with its 
emphasis on single-species, even-aged forest stands 
has developed the simple case and provided an example 
of how much effort it takes to develop such the 
simple case. The task of understanding the complex 
cases of natural ecosystems in the face of novel condi¬ 
tions is both daunting and essential. The real challenges 
to sustainability science is the development of mean¬ 
ingful policies and protocols when the essentials, such 
as succession theory, are themselves changing with new 
discovery. 

Bibliography 

Primary Literature 

1. Clements FE (1916) Plant succession: an analysis of the devel¬ 
opment of vegetation. Carnegie Institute, Washington, DC, 
Publication no 242 

2. Clements FE (1928) Plant succession and indicators. Wilson, 
New York 

3. Clements FE (1936) Nature and structure of the climax. J Ecol 
24:252-284 

4. McCune B, Grace JB, Urban DL (2002) Analysis of ecological 
communities. MjM Software Design, Gledenon, 304 pp 

5. Tansley AG (1935) The use and abuse of vegetational concepts 
and terms. Ecology 16:284-307 

6. Drury WH, Nisbet ICT (1973) Succession. J Arnold Arbor 
54:331-368 

7. Gleason HA (1926) The individualistic concept of the plant 
association. Bull Torrey Bot Club 53:1-20 

8. Gleason HA (1939) The individualistic concept of the plant 
association. Am Midi Nat 21:92-110 

9. Rich PH (1988) The origin of ecosystems by means of subjec¬ 
tive selection. In: Pomeroy LR, Alberts JJ (eds) Concepts of 
ecosystem ecology. Springer, New York, pp 19-28 

10. Egler FE (1954) Vegetation science concepts. I. Initial floristic 
composition - a factor in old-field vegetation development. 
Vegetatio 4:412-417 

11. McIntosh RP (1985) The background of ecology: concept and 
theory. Cambridge University Press, Cambridge, 383 pp 

12. Tansley AG (1947) Obituary notice. Frederic Edward Clements, 
1874-1945. J Ecol 34:194-196 


13. White PS (1979) Pattern, process and natural disturbance in 
vegetation. Bot Rev 45:229-299 

14. Phillips J (1934) Succession, development, the climax and the 
complex organism: an analysis of concepts. I. J Ecol 
22:554-571 

15. Phillips J (1935) Succession, development, the climax and the 
complex organism: an analysis of concepts. II. J Ecol 
23:210-246 

16. Phillips J (1935) Succession, development, the climax and the 
complex organism: an analysis of concepts. III. J Ecol 
23:488-508 

17. Heinselman ML (1981) Fire and succession in the conifer for¬ 
ests of northern North America. In: West DC, Shugart HH, 
West DC (eds) Forest succession: concepts and application. 
Springer, New York, pp 374-405 

18. Cattelino PJ, Noble IR, Slatyer RO, Kessell SR (1979) Predicting 
the multiple pathways of plant succession. Environ Manage 
3:41-50 

19. Connell JH, Slatyer RO (1977) Mechanisms of succession in 
natural communities and their role in community stability 
and organization. Am Nat 111:1119-1144 

20. Glenn-Lewin DC, Peet RK, Veblin TT (eds) (1992) Plant succes¬ 
sion: theory and prediction. Chapman and Hall, London 

21. Kessell SR, Potter MW (1980) A quantitative succession model 
for nine Montana forest communities. Environ Manage 
4:227-240 

22. Weaver JE, Clements FE (1938) Plant ecology, 2nd edn. 
McGraw-Hill, New York, 520 pp 

23. McIntosh RP (1981) Succession and ecological theory. In: 
West DC, Shugart HH, Botkin DB (eds) Forest succession: con¬ 
cepts and application. Springer, New York, pp 10-23 

24. McIntosh RP (1975) H.A. Gleason - "Individualistic ecologist" 
1882-1975; His contributions to theoretical ecology. Bull 
Torrey Bot Club 102:253-273 

25. Gleason HA (1910) The vegetation of the inland sand deposits 
of Illinois. Bull Illinois State Lab Nat Hist 9:21-174 

26. Gleason HA (1927) Further views on the succession concept. 
Ecology 8:299-326 

27. Gleason HA (1917) The structure and development of the 
plant succession. Bull Torrey Bot Club 44:463-481 

28. Cowles HC (1899) The ecological relations of the vegetation 
on the sand dunes of Lake Michigan. Bot Gaz 27:95-117, 
176-202, 281-308, 361-369 

29. Cowles HC (1901) The physiographic ecology of Chicago and 
vicinity. Bot Gaz 31:73-108 

30. Adams CC (1935) The relation of general ecology to human 
ecology. Ecology 16:316-335 

31. Clements FE (1935) Experimental ecology in the public service. 
Ecology 16:342-363 

32. Transeau EN (1935) The prairie peninsula. Ecology 16:423-437 

33. Fuller GD (1935) Postglacial vegetation of the Lake Michigan 
region. Ecology 16:473-487 

34. Sears PB (1935) Types of North American pollen profiles. 
Ecology 16:488-499 



Ecological Succession and Community Dynamics 


E 


3295 


35. Smuts JC (1926) Holism and evolution. Macmillan, London 

36. Phillips J (1955) A tribute to Frederic E. Clements and his 
concepts in ecology. Ecology 35:114-115 

37. Shugart HH (1998) Terrestrial ecosystems in changing 
environments. Cambridge University Press, Cambridge, 
537 pp 

38. Watt AS (1925) On the ecology of British beech woods with 
special reference to their regeneration. II. The development 
and structure of beech communities on the Sussex Downs. 
J Ecol 13:27-73 

39. Watt AS (1947) Pattern and process in the plant community. 
J Ecol 35:1-22 

40. Odum EP (1953) Fundamentals of ecology. W. B. Saunders, 
Philadelphia, 383 pp 

41. Jordan CF (1986) Ecological effects of nuclear radiation. In: 
Orians GH (ed) Ecological knowledge and environmental 
problem solving: concepts and case studies. National 
Academy Press, Washington, DC, pp 331-344 

42. Zilversmit DB, Entenmann C, Fishier MC (1943) On the calcu¬ 
lation of turnover time and turnover rate from experiments 
involving the use of labeling agents. J Gen Physiol 26:325-331 

43. Odum EP (1969) The strategy of ecosystem development. 
Science 164:262-270 

44. Pound R (1954) Frederic E. Clements as I knew him. Ecology 
35:112-113 

45. Arthur JC (1895) Development of vegetable physiology. 
Science 44:164-184 

46. Tobey R (1976) Theoretical science and technology in 
American ecology. Technol Cult 17:461-534 

47. Tobey R (1981) Saving the prairies: the life cycle of the 
founding school of American plant ecology 1895-1955. 
University of California Press, Berkley 

48. Washington-Alien RA, West NE, Douglas Ramsey R, Phillips DK, 
Shugart HH (2010) Retrospective assessment of soil stability 
on a landscape subject to commercial grazing. Environ Monit 
Assess 160:101-121 

49. Williams E (2003) Deforesting the earth: from prehistory to 
global crisis. University of Chicago Press, Chicago, 689 pp 

50. Pretzsch H (2009) Forest dynamics, yield and growth: from 
measurement to model. Springer, Berlin 

51. Kimmins JP (1987) Forest ecology. Macmillan, Vancouver 

52. Kira T (2001) Forest and environment: an approach to global 
environmental issues. Shin-Shisosha, Tokyo 

53. Solomon S, Qin D, Manning M, Chen Z, Marquis M, Averyt KB, 
Tignor M, Miller HL (eds) (2007) Climate change 2007: the 
physical science basis. Contribution of working group I to 
the fourth assessment report of the intergovernmental panel 
on climate change. Cambridge University Press, Cambridge, 
UK/New York, 996 pp 

54. Shugart HH, Woodward FI (2011) Global change and the 
terrestrial biosphere: achievements and challenges. 
Wiley-Blackwell, Oxford, 216 pp 

55. Huston M, DeAngelis DL, Post WM (1988) New computer 
models unify ecological theory. Bioscience 38:682-691 


56. DeAngelis DL, Gross LJ (eds) (1992) Individual-based models 
and approaches in ecology: populations, communities and 
ecosystems. Chapman and Hall, New York 

57. Newnham RM (1964) The development of a stand model for 
Douglas-Fir. PhD thesis, University of British Columbia, 
Vancouver 

58. Mitchell KJ (1969) Simulation of growth of even-aged stands 
of white spruce. Yale Univ Sch Forest Bull 75:1-48 

59. Mitchell KJ (1975) Dynamics and simulated yield of Douglas- 
fir. Forest Sci Monogr 17:1-39 

60. Hegyi F (1974) A simulation model for managing jack-pine 
stands. In: Fries J (ed) Growth models for tree and stand 
simulation. Royal College of Forestry, Stockholm, pp 74-90 

61. Ek AR, Monserud RA (1974) FOREST: computer model for the 
growth and reproduction simulation for mixed species forest 
stands. Research report A2635, College of Agricultural and Life 
Sciences, University of Wisconsin, Madison 

62. Holling CS (1961) Principles of insect predation. Annu Rev 
Entomol 6:163-182 

63. Holling CS (1964) The analysis of complex population pro¬ 
cesses. Can Entomol 96:335-347 

64. Rohlf FJ, Davenport D (1969) Simulation of simple models of 
animal behavior with a digital computer. J Theor Biol 
23:400-424 

65. Botkin DB, Janak JF, Wallis JR (1972) Some ecological conse¬ 
quences of a computer model of forest growth. J Ecol 
60:849-872 

66. Shugart HH, West DC (1980) Forest succession models. 
Bioscience 30:308-313 

67. Watt AS (1925) On the ecology of British beech woods with 
special reference to their regeneration. II. The development 
and structure of beech communities on the Sussex Downs. 
J Ecol 13:27-73 

68. Bormann FH, Likens GE (1979) Pattern and process in 
a forested ecosystem. Springer, New York 

69. Bormann FH, Likens GE (1979) Catastrophic disturbance and 
the steady state in northern hardwood forests. Am Sci 
67:660-669 

70. Bonan GB, Pollard D, Thompson SL (1992) Effects of boreal 
forest vegetation on global climate. Nature 359:716-718 

71. Betts RA (2000) Offset of the potential carbon sink from boreal 
forestation by decreases in surface albedo. Nature 
408:187-190 

72. Kharuk V, Ranson K, Dvinskaya M (2007) Evidence of evergreen 
conifer invasion into larch dominated forests during 
recent decades in Central Siberia. Eurasian J Forest Res 10: 
163-171 

73. Kharuk VI, Ranson KJ, Sergey Tl, Dvinskaya ML (2009) Response 
of Pinus sibirica and Larixsibirica to climate change in southern 
Siberian alpine forest-tundra ecotone. Scand J Forest Res 
24:130-139 

74. Yan X, Shugart HH (2005) A forest gap model to simulate 
dynamics and patterns of Eastern Eurasian forests. 
J Biogeogr 32:1641-1658 


E 




3296 


E 


Ecological Systems, Introduction 


75. Shuman JK, Shugart HH (2009) Evaluating sensitivity of 
Eurasian forest biomass to climate change using a dynamic 
vegetation model. Environ Res Lett 4:045024. doi:10.1088/ 
1748-9326/4/4/045024 

76. National Climate Data Center (NCDC) (2005) TD-9813 daily and 
sub-daily precipitation for the former USSR Version 1.0. NOAA 
National Climatic Data Center, Asheville 

77. Stolbovoi V, McCallum I (eds) (2002) CD-ROM land resources of 
Russia. International Institute for Applied Systems Analysis 
and the Russian Academy of Science, Laxenburg 

78. Van Cleve K, Viereck LA (1981) Forest succession in relation to 
nutrient cycling in the boreal forest of Alaska. In: West DC, 
Shugart HH, Botkin DB (eds) Forest succession: concepts and 
application. Springer, New York, pp 185-221 


Ecological Systems, Introduction 

Rik Leemans 

Environmental Systems Analysis Group, Wageningen 
University, Wageningen, The Netherlands 

It is estimated that there are roughly 8 million animal 
species, 600,000 fungi, 300,000 plants, and an 
unestimated number of microbial species on earth. It 
is further estimated that only about 25% of the animal, 
fungi, and plant species have been identified as yet. 
Further, the interactions among these species and 
their physical environment are known to an even lesser 
degree. Meanwhile, the earth’s biota faces the prospect 
of climate change which may be either slow to manifest 
or extremely rapid, as we pass through a potential 
tipping point. In addition, human population is set to 
grow by two billion by 2045 from a present seven 
billion. This would certainly be rated as a population 
explosion, and these numbers together with expected 
(and hoped for) economic growth will stress the entire 
ecology of the earth. The question is how to cope with 
all of the above. That is actually the subject and goal of 
this entire Encyclopedia and our team of well over 
700 scientists and engineers, and also the subject of 
this section from an ecological systems viewpoint. 
Our objective in this section is to provide a significant 
portion of the scientific and engineering basis of the 
systems ecology of the earth and to provide this in 
14 detailed entries written at a level for use by univer¬ 
sity students through practicing professionals. Our 


approach is to recognize that we cannot wait for 
a complete data set for the biota before assessing and 
planning and acting to preserve the ecological balance 
of the earth. Therefore, methodology is presented 
aimed at identifying the key interactions and environ¬ 
mental effects and enabling a systems level understand¬ 
ing even at our present state of factual knowledge. 

For, ease of use by students, each entry begins with a 
glossary of terms, while at an average length of 20 print 
pages each, sufficient detail is presented for utilization 
by professionals in government, universities, and 
industry. The reader is also directed to the closely 
related sections: ► Earth System Monitoring, Intro¬ 
duction and also ► Transport and Fate of Chemicals 
in the Environment, Introduction. 

Each of the entries is summarized below. 

► Ecosystems, Adaptive Management - Adaptive 
management is an approach to natural resource man¬ 
agement that emphasizes learning through manage¬ 
ment based upon the philosophy that knowledge is 
incomplete and much of what is thought to be known 
is actually wrong, but despite uncertainty managers 
and policymakers must act. This entry includes: Struc¬ 
ture Decision Making, Participatory Active Adaptive 
Management, Adaptive Governance, and Adaptive 
Management and Law. 

► Marine Biogeochemistry - The biogeochemistry 
of the world oceans has been studied for many decades, 
and major advances in understanding have been linked 
with development of new techniques and tools that 
allow the accurate representation of various organic 
and inorganic pools within the water. It has been 
shown that some critical bioactive compounds 
(containing the elements: carbon, nitrogen, phospho¬ 
rus, oxygen) occur with particular atomic ratios to one 
another that are relatively invariant over space and time 
pools. The following cycles are presented in detail: 
carbon ocean Acidification, nitrogen and phospho¬ 
rous, iron (including iron ocean fertilization), sulfur, 
and oxygen. 

► Atmospheric Biogeochemistry - Biogeochemis¬ 
try represents the interaction of biology, chemistry, 
and geology in the Earth system. For many processes, 
an understanding of biological uptake and emission, 
chemical processing, and geological sequestration is 
necessary to resolve the sources and sinks of 
a particular constituent. For example to discover the 
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sources and sinks of atmospheric carbon dioxide, it is 
important to understand how biota take up carbon 
dioxide and chemically convert the carbon to organic 
carbon, and then how this organic carbon is used 
either to produce energy by biota or is deposited to 
the land or ocean surface and can become sequestered 
in geological formations. Often when biogeochemis¬ 
try is referred to, one refers to the nutrient cycling of 
important nutrients. Atmospheric biogeochemistry, 
as defined in its narrowest sense, is the flux of nutri¬ 
ents and pollutants important for biogeochemistry 
through the atmosphere. In its broadest sense, atmo¬ 
spheric biogeochemistry is the interaction of the 
atmosphere and atmospheric processes with the bio¬ 
geochemistry of the earth system. This broader defini¬ 
tion is used here. 

► Ecological Succession and Community Dynamics - 
Ecological succession and the dynamic change of 
communities is one of the pillars of modern ecology and 
it has considerable influence on sustainability science. The 
processes that drive succession and the regularly of 
patterns of succession have been the topic of considerable 
debate over the past century - due in no small part to the 
importance of the concept to basic and applied ecology. 
Succession is at its simplest level the pattern of change in 
ecological communities and these changes can be read on 
the landscape if one knows regional history or conversely 
can be used to interpret a landscapes history. 

► Ecosystem Engineers, Keystone Species - This 
contribution focuses on two ecological phenomena. 
The first is “keystone species” which is defined as a 
species (mostly of high trophic status) whose activities 
exert a disproportionate influence on the patterns of 
species occurrence, distribution, and density in a com¬ 
munity. The second is the concept of “ecosystem engi¬ 
neers” defined as organisms that directly or indirectly 
modulate the availability of resources (other than 
themselves) to other species by causing physical state 
changes in biotic or abiotic materials. Models have 
shown that the loss of species with a large number of 
trophic interactions can trigger high numbers of 
secondary extinctions with serious consequences for 
species persistence. The importance of keystone species 
can also be strongly linked with ecosystem engineering. 
For example, the large impact of sea otters in kelp forest 
ecosystems results from the coupling between 
engineering effects and a trophic cascade. 


► Ecosystem Services - This entry provides an 
overview of issues concerning the identification, bio¬ 
physical and economic characterization, and 
safeguarding of ecosystem services. The concept of 
ecosystem services has a long written history, reaching 
back at least as far as Plato. This review history focuses 
on recent advances such as the Millennium Ecosystem 
Assessment, and advances in spatial modeling, eco¬ 
nomic valuation, and policy implementation. Exam¬ 
ples of novel finance and policy mechanisms, including 
water funds, marine spatial planning, land-use plan¬ 
ning and human development, and global policy efforts 
are presented. The entry concludes with a discussion of 
the largest research and implementation challenges in 
this field, identifying the issues that will frame the 
future growth of the concept of ecosystem services. 

► Ecosystems and Spatial patterns - Ecological 
processes such as forest disturbances act on ecosys¬ 
tems at multiple spatial and temporal scales to generate 
complex spatial patterns. These patterns in turn influ¬ 
ence ecosystem dynamics and have important conse¬ 
quences for ecosystem sustainability. Analysis of 
ecosystem spatial structure is a first step toward 
understanding these dynamics and the uncertain 
interactions among processes. Quantitative character¬ 
ization of spatial patterns and their rates of change in 
natural environments are essential to understanding 
ecological processes and to inform sustainable man¬ 
agement techniques that aim to minimize degradation 
and alteration of ecosystem dynamics. This entry 
covers: sources of heterogeneity, spatial analysis, spa¬ 
tial analyses within the ecosystem and spatial analyses 
among ecosystems. 

► Ecosystem Flow Analysis - It is necessary to 
understand the ecosystem as a thermodynamic system 
and how the energy flows enter, interconnect, and 
disperse from the environmental system. Ecological 
network methodologies exist to investigate and analyze 
these flows. In particular, partitioning the flow into 
boundary input, non-cycled internal flow, and cycled 
internal flow shows the extent to which reuse and 
recycling arise in ecosystems. The intricate, complex 
network structures are responsible for these processes 
all within the given thermodynamic constraints. 
Design of sustainable human systems could be 
informed by these organizational patterns, in order to 
use effectively the energy available. This entry 
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demonstrates the need for flow analysis, provides a 
brief example using a well-studied ecosystem, 
and discusses some of the ecosystem development 
tendencies which can be addressed using ecosystem 
flow analysis. 

► Ecotones and Ecological Gradients - Ecotones 
are areas of steep transition between ecological 
communities, ecosystems, or ecological regions 
(such as Mediterranean and desert). Ecotones often 
occur along ecological gradients. Such gradients are 
created as a result of spatial shifts in elevation, climate, 
soil, and many other environmental factors. Ecotones 
commonly coincide with areas of sharp climatic tran¬ 
sition along environmental gradients. They occur at 
multiple spatial scales, from continental-scale transi¬ 
tions between major biomes to small-scale ecotones 
where local vegetation communities and microhabitats 
coincide. They show a diversity of boundary types that 
range from natural boundaries (e.g., altitudinal, latitu¬ 
dinal transitions) to human-generated ecotones 
(e.g., forest clear-cut edges or urban ecotones). 
Ecotones have been studied in the past four decades 
in an ecological context and in recent years are receiv¬ 
ing increasing attention in the context of biodiversity 
conservation. Various studies have shown that species 
richness and abundances tend to peak in ecotonal 
areas, though exceptions to these patterns occur. Eco¬ 
tones are “natural laboratories” for studying a range of 
evolutionary processes, such as the process by which 
new species form, also termed speciation. Recent 
evidence suggests that ecotones may serve as speciation 
centers. Some researchers argue that ecotones deserve 
high conservation investment, potentially serving as 
speciation and biodiversity centers. Because ecotones 
are often small in size and relatively rich in biodiversity, 
conservation efforts in these areas may prove to be an 
efficient and cost-effective conservation strategy. 

► Invasive Species - This entry describes the causes 
and consequences of biological invasions, by synthesiz¬ 
ing concepts from population biology, community 
ecology, evolution, biogeography, and conservation 
biology. First, the patterns and process of invasion are 
explored; then, some of its potential ecological and 
socioeconomic impacts are examined. Some major 
hypotheses and theoretical concepts explaining 
patterns of colonization and impact are presented. 
Next, management approaches to assessing, 


preventing, and mitigating this problem are consid¬ 
ered. The entry ends with a brief glimpse at some of 
the emerging issues that will likely be the foci of future 
research. 

► Landscape Ecology - Landscapes are spatially 
heterogeneous areas characterized by diverse patches 
that differ in size, shape, contents, and history. When 
spatial heterogeneity is considered, scale matters and 
hierarchy emerges. Landscape ecology is the science 
and art of studying and improving the relationship 
between spatial pattern and ecological processes on a 
multitude of scales and organizational levels. In a broad 
sense, landscape ecology represents both a field of study 
and a scientific paradigm. As a highly interdisciplinary 
and transdisciplinary enterprise, landscape ecology 
integrates biophysical and analytical approaches with 
humanistic and holistic perspectives across natural 
and social sciences. Landscape ecology was initially 
developed in Europe. With theoretical developments 
in spatial ecology and technological advances in remote 
sensing and geospatial information processing, land¬ 
scape ecology became an internationally recognized 
field of study in the 1980s. The most salient character¬ 
istics of landscape ecology are its emphasis on the 
pattern-process relationship and its focus on broad- 
scale ecological and environmental issues. Key research 
topics in landscape ecology include ecological flows in 
landscape mosaics, land use and land cover change, 
scaling, relating landscape pattern analysis with ecolog¬ 
ical processes, and landscape conservation and 
sustainability. 

► Species Competition and Predation - Competi¬ 
tion and predation are key interactions between spe¬ 
cies, and are major foci of thought and study in 
community ecology. They are believed to be major 
forces structuring natural communities, having critical 
roles in the determination of species diversity and 
species composition, and are regarded as important 
drivers of evolutionary processes. The relationships 
between the niches of different species determine how 
they interact through competition and predation, 
which then have key roles in assembly of local commu¬ 
nities, and their reassembly following perturbations. 
Niches define patterns of linkages between species, 
their resources, and their natural enemies. These pat¬ 
terns include how linkages change over time, and 
between different spatial locations, and define 
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mechanisms by which similar species are able to coexist 
by their effects on competition and predation relation¬ 
ships. The human element in the environment has 
profound effects on these phenomena. Changing the 
environment shifts interactions between species, and 
profoundly modifies the structure of food webs. In the 
modern day, there is much community reassembly, 
potentially involving major shifts in competition and 
predation. Humans transport invasive species that act 
as predators, prey, and competitors with potentially 
major effects on the community reassembly process. 

► Species Diversity Within and Among Ecosystems - 
Species diversity is a function of species richness, 
the number of species in a given locality and species 
evenness, the degree to which the relative abundances 
of species are similar. Commonly used quantification 
methods include constructing mathematical indices 
known as diversity indexes (the Shannon, Simpson, 
and Margalef indexes being the most widely used) or 
comparing observed patterns of species abundance to 
theoretical models. The scale of assessment may 
range from within a single site or habitat (known as 
a diversity) to the difference between two or more sites 
((3 diversity); which can then be combined to give 
y diversity - the diversity of the landscape. 
Early research tended to be concentrated on largely 
a diversity, but there has been a marked increase in 
studies investigating (3 and y diversity during the last 
decade. The species richness aspect of diversity is often 
considered the iconic measure of biodiversity since it is 
well defined and aligns with our intuitive sense of the 
concept. As the species level is the level at which living 
organisms are most widely known, species have become 
the major taxonomic rank for describing biodiversity. 
Along with populations, it is also the level that most 
scientists, managers, and policy makers use when refer¬ 
ring to biological diversity and it provides a useful means 
for both monitoring and priority setting. As the species 
level is one of three fundamental levels of biodiversity 
(the others are the genetic level and the ecosystem/com¬ 
munity level), species diversity is thus an important 
component of conservation. 

► Urban Ecology - Within the science of ecology, 
urban ecology is defined as the study of structure, 
dynamics, and processes in urban ecological systems. 
Urban ecology is the study of the relationships of 
human and nonhuman organisms in urban areas, the 


interactions of these organisms with the native and 
built physical environment, and the effects of these 
relationships on the fluxes of energy, materials, and 
information within individual urban systems and 
between urban and nonurban systems. Urban ecology 
applies the methods and concepts of the biological 
science of ecology to urban areas, but requires and 
integrates with the concerns, concepts, and approaches 
of social sciences to produce a hybrid discipline. Urban 
ecological systems include individual organisms, 
populations, communities, and landscapes, as well as 
buildings and infrastructure. Urban ecology further 
recognizes specific urban ecosystems as a part of 
the global biogeochemical, economic, and human 
demographic system. 
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Glossary 

Connectance The proportion of possible ecological 
interaction links between species that are realized. 
Ecosystem engineer An organism that creates or 
modifies its habitat [19]. 
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Ecosystem functioning The way ecosystems work 
related to abiotic and biotic components, such as 
chemicals, water, soil, microbes, plants, and animals. 
Keystone species A species that has a disproportionate 
effect on its environment relative to its biomass 
(Paine 1995). 

Trophic level The position a species occupies in a food 
chain. 

Definition of the Subject 

This entry focuses on two ecological phenomena. The 
first is “keystone species” which is defined by Paine 
[1] as a species (mostly of high trophic status) whose 
activities exert a disproportionate influence on the 
patterns of species occurrence, distribution, and den¬ 
sity in a community. The second is the concept of 
“ecosystem engineers” defined by Jones et al. [2] as 
organisms that directly or indirectly modulate the 
availability of resources (other than themselves) to 
other species by causing physical state changes in biotic 
or abiotic materials. 

Introduction: Keystone Species and Ecosystem 
Engineers: Analysis of Concepts 

Paine’s definition of keystone species was inspired from 
the large effects of the removal of the carnivorous 


starfish [disaster ochraceus) from intertidal habitat, 
which reduced prey species diversity due to intense 
competition from mussel prey [3], and represents 
now a classic textbook in ecology. The original keystone 
species concept of Paine [1, 4] thus identified a very 
specific mechanism: the top-down regulation of com¬ 
munity structure and diversity by a top predator 
(Fig. 1). The concept of keystone species has been 
later extended to a broader definition and now includes 
any species whose effect on ecosystems is dispropor¬ 
tionately large relative to its low biomass in the com¬ 
munity as a whole [5] . Keystone species are thus species 
which have large effects on communities and ecosys¬ 
tems through many different processes such as trophic 
interactions, pollination, or habitat modification [6, 7]. 
Examples include rabbits that can increase abundance 
and diversity of lizards [8] and sea otters whose hunt¬ 
ing in the late nineteenth century caused a population 
explosion of their sea urchin prey and consequent 
overgrazing of kelp which led to numerous extinctions 
of local species [6]. 

The concept of ecosystem engineering was pro¬ 
posed two decades later than the “keystone species” 
concept by Jones and colleagues [2]. They defined 
ecosystem engineers as “organisms that directly or 
indirectly modulate the availability of resources 
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Ecosystem Engineers, Keystone Species. Figure 1 

Two examples of keystone species impacts, (a) Effects of the removal of Pisaster on prey species diversity as a consequence 
of mussel population explosion, (b) Consequences of the removal of sea otters on species diversity due to overgrazing of 
kelp by sea urchins. Keystone species are represented in grey boxes. Small grey arrows indicate the direction of species 
abundance changes following the removal of the keystone species. The large grey arrows indicate the global 
consequences of keystone species loss on the ecosystem 
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(other than themselves) to other species by causing 
physical state changes in biotic or abiotic materials. 
In so doing they modify, maintain and/or create habi¬ 
tats” [2] . They further distinguish between two types of 
ecosystem engineers (Fig. 2): autogenic engineers that 
change the environment via their own physical struc¬ 
tures, i.e., their living and dead tissues, and allogenic 
engineers that change the environment by 
transforming living or nonliving materials from one 
physical state to another via mechanical or other 
means. The idea that organisms can have important 
effects on abiotic processes occurring in the environ¬ 
ment had been recognized before; indeed, Darwin 
devoted a whole book to the impact of earthworms 
on soil formation [9]. However, since the development 
of the concept of ecosystem engineer, engineering 
effects have been described for many organisms, from 
classic examples such as beavers, termites, or earth¬ 
worms [10-12] to mollusks [13], fish [14], caterpillars 
[15], polychaete worms [16], grasses [17, 18], 
burrowing shrimp [19], ants [20], and many other 
species (see Table 1 of [21]). 

Both the keystone species and the ecosystem 
engineer concepts point out to species which have 


important effects in ecological communities and eco¬ 
systems. Although these concepts partly overlap - an 
ecosystem engineer can be a keystone species - they 
however insist on different aspects: the keystone con¬ 
cept focuses on species which have disproportionate 
effects on community structure and ecosystem 
functioning (“outcome focused” sensu [22]) whereas 
the ecosystem engineering concept considers organ¬ 
isms which influence the abiotic environment with 
consequences on other species and related ecosystem 
processes (“process focused” sensu [22]). These differ¬ 
ences between the two concepts are reflected in the 
literature: these concepts generally appear in distinct 
studies as less than 5% of the studies on these topics 
refer to both ecosystem engineers and keystone species 
(source: ISI Web of Science). 

The keystone species concept has been strongly 
related to food web theory since its first definition 
[1, 23]. In particular, the identification of keystone 
species in food webs is an important issue. Theoretical 
studies have tried to pin down the characteristics of 
keystone species through food web models [24, 25] and 
several indices based on food web topology have been 
developed to identify keystones [26]. Models have 
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Ecosystem Engineers, Keystone Species. Figure 2 

Two examples of ecosystem engineering with kelp (a) and earthworms (b) as ecosystem engineers. Ecosystem 
engineering corresponds to changes in physical state (state 1 -2) of biotic (i.e., kelp for a) or abiotic (i.e., soil for b) materials. 
Ecosystem engineers are represented in grey boxes, (a) Case of autogenic engineering, the engineer is part of the 
new physical state (via growth here), (b) Case of allogenic engineering, the new physical state is caused by the 
engineer (x, caused via feeding here), but the engineer is not part of the new physical state 
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shown that the loss of species with a large number of 
trophic interactions can trigger high numbers of sec¬ 
ondary extinctions with serious consequences for spe¬ 
cies persistence; a result which highlights the potential 
keystone role of highly connected species in food webs 
[24, 27, 28]. 

In contrast, the concept of ecosystem engineering 
has been rarely related to food web studies. Recent 
studies acknowledge that ecosystem engineers may 
also play an important role in the network of trophic 
interactions but separating the trophic effects from the 
engineering effects to determine their relative impor¬ 
tance is difficult [20, 29-31]. 

The importance of keystone species can also be 
strongly linked with ecosystem engineering. For exam¬ 
ple, the large impact of sea otters in kelp forest ecosys¬ 
tems results from the coupling between engineering 
effects and a trophic cascade [32]. In these ecosystems, 
kelp provides habitat for many species and dampens 
wave action; the keystone effect of sea otters is thus 
mediated through their indirect trophic effect on kelp 
densities which is a main ecosystem engineer. 

Issue-1: How to Find Keystone Species and 
Ecosystem Engineers in Communities? 

Keystone species and ecosystem engineers may affect 
ecosystem processes, such as nutrient cycling, and 
thereby ecosystem functioning. In the face of rapid 
biodiversity loss, a considerable amount of studies 
were dedicated to investigate a possible link between 
species richness and ecosystem function [33] and the 
threat of diversity loss on the loss of ecosystem services 
to man. First indications show positive relationships 
between species richness and ecosystem productivity, 
stability, and sustainability, with more species being 
able to fully and complementarily run ecosystem 
functions due to niche differentiation and facilitative 
interactions (reviewed by [34]). However, there is now 
a growing consensus that functional diversity, rather 
than species numbers per se, strongly determines 
ecosystem functioning [35]. This means that the 
presence of a particular species with specific traits 
may play a larger role in determining ecosystem 
function than merely the number of species [36]. The 
apparent diversity-ecosystem function relationship can 
thus be partly caused by a greater chance of an 


influential species with particular traits being present 
in more diverse communities than in species-poor 
communities. 

If it is possible to predict and identify a priori a set of 
species traits that determine keystone interactions in 
a system, this would greatly benefit management and 
conservation purposes. Species’ traits determine how 
species contribute to ecosystem processes, so the presence 
and distribution of such traits can be utilized to indicate 
aspects of ecosystem functioning [37]. To identify key¬ 
stone species various methods have been used ranging 
from experimental removal or addition manipulations 
to comparative studies and natural history observa¬ 
tions [5] . Partly because of these methodological issues, 
identifying keystone species has so far proved elusive 
[5, 38] although some progress has been made and its 
concept now widely investigated in the context of com¬ 
plex ecological networks [25, 39-41]. Some examples 
of specific traits are for instance trophic level, body size, 
connectance, or traits concerning tolerance and resil¬ 
ience to disturbances. Organisms that influence their 
environment strongly and contribute disproportion¬ 
ately to the functioning of ecosystems often seem to 
occupy higher trophic levels in food webs [5]. Top 
predators have been described as highly interactive 
keystone species [42], have been shown to play an 
important role in stabilizing food webs [43], and play 
important roles in marine ecosystems [44] and terres¬ 
trial ecosystems [45]. 

Also, the loss of top predators has been linked to 
secondary extinctions [46,47] . This has been attributed 
to their ecological role as suppressors of medium-sized 
predators (mesopredators) (e.g., [48, 49]) and general¬ 
ist herbivores [50, 51]. In terrestrial ecosystems, organ¬ 
isms that influence their environment strongly also 
often seem to be large bodied (e.g., [52] ). Larger bodied 
organisms require a high resource and energy use per 
individual [53, 54] and have greater mobility, home 
ranges, and longevity [55, 56] and, thereby, control 
more resources over greater and coarser spatial scales 
[52, 57]. It is also proposed that well-linked and 
interacting species as key interactors are more impor¬ 
tant for the community [28, 58-62]. This approach 
characterizes the interaction structure of species placed 
in an ecological network. Among plants, on the other 
hand, some studies have shown that species within the 
same functional types but with different requirements 
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and tolerances may provide insurance to the system in 
the form of long-term resilience against changes in 
environmental factors, such as global warming, graz¬ 
ing, drought or frost [35]. 

The latter example indicates that the keystone status 
of a species often appears to be context dependent, and 
may change with successional status, productivity, diver¬ 
sity, and other ecosystem traits [63]. It is therefore 
important to identify how the importance of traits 
that define keystone species change across a gradient 
of conditions, measuring environmental factors, com¬ 
munity composition, trophic dynamics, and distribu¬ 
tion of strong and weak links in the community (e.g., 
[24]). Without droughts, a specific plant species may 
not play an important role in maintaining community 
composition or ecosystem functioning. The Australian 
brushtail possum (Trichosurus vulpecula) may function 
as a keystone species in rata-kamahi forests by 
defoliating and killing canopy trees, but not in beech- 
dominated forests where floristic composition, but not 
forest structure, is typically affected [64-66]. A species 
in its native grounds may play no specific role in the 
system, whereas an invasive species may have devastat¬ 
ing effects in the system it got introduced into, e.g., 
feral cats and rats on islands [67], Alewife (Alosa 
pseudoharengus ) in nonnative freshwater lakes and 
ponds [68], and Cheatgrass (Bromus tectorum) in 
nonnative grasslands [69]. 

Identifying keystone species therefore is not with¬ 
out its problems. It is also important to notice that 
ecologically important species might not necessarily 
be the ones that are also considered important by 
traditional conservationists (i.e., rare species; [70]). 

Issue-2: Usefulness for Management 

Because of the limited resources available in compar¬ 
ison to conservation needs, it has been proposed to 
design protection of single species in the aim of 
indirectly protecting the regional biota. These 
“surrogate species” are roughly of three categories 
[71]: (1) flagships, charismatic species that attract 
public support; (2) umbrellas, species requiring such 
habitats that their protection might protect other spe¬ 
cies; and (3) biodiversity indicators, taxa whose pres¬ 
ence may indicate high species richness. However 
the effectiveness of these policies has been questioned 


and [70] suggested that single-species management 
might be more effective when directed toward key¬ 
stone species. Indeed, the importance of keystone 
species and ecosystem engineers in communities 
make these species particularly important conservation 
targets, since the loss of these species can affect entire 
communities and ecosystems [72]. However, the main 
difficulty for applying these concepts to conservation 
issues lays on both the identification of keystone 
species and ecosystem engineers in communities and 
on the context dependence of their impacts, as 
discussed in the previous section. Thus, although 
these concepts appear relevant for conservation poli¬ 
cies, it is still a long way from providing general and 
practical recommendations for conservationists and 
managers [71]. 

The concepts of keystone species and of ecosystem 
engineers could also be useful for other management 
issues in natural and anthropized ecosystems, such as 
for ecosystem restoration or agriculture. For example, 
in agro-ecosystems, several well-known ecosystem 
engineers have been used to improve soil fertility and 
crop yield. In some countries, farmers make use of 
the soil fertilizing effect of termites by spreading ter¬ 
mite mound soil in their field [73]. Similarly, earth¬ 
worm inoculation has generally positive effects on crop 
yield [74]. 

Future Directions 

The notions of ecosystem engineers and keystone spe¬ 
cies have been playing prominent roles in ecology for 
several decades, still many questions and uncertainties 
ask for further investigations. Three of them are briefly 
described here. The first is how keystone roles and 
engineering effects are related to body-size: Do larger 
organisms have larger effects than smaller organisms? 
The second regards the context dependence of keystone 
roles and engineering effects: As the composition and 
structure of ecological communities are dynamic both 
in terms of species composition and species abun¬ 
dances, what does that imply for the role species have 
in communities and ecosystem functioning? The third 
line of research might be the most relevant for our 
society: How can the concepts for nature conservation, 
biodiversity protection, and the enhancement of 
environmental quality be used? 
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Glossary 

Consumer Heterotrophic organism that consumes 
other organisms for their energy requirements. 
Cycling The process by which energy or matter returns 
from its compartment of origin before exiting the 
system boundary. 

Ecological goal function Tendency observed in the 
orientation or directional development of ecologi¬ 
cal systems. 


Flow The transfer of energy or matter from one 
compartment in the system to another by active 
(feeding) or passive (death, egestion) means. 
Network analysis A mathematical tool to study 
objects as part of a connected system and to identify 
and quantify the direct and indirect effects in that 
system. 

Primary producer Photosynthesizing organism that 
captures external energy sources and brings it into 
the system as the basis for all subsequent thermo¬ 
dynamic activity. 

Thermodynamic system A bounded system defined 
by the quantities of energy and matter flowing 
through it. 

Definition 

Ecosystem services provide the basis for all human 
activity. Maintaining their sustained function is of crit¬ 
ical concern to the issues of sustainability addressed 
here in this encyclopedia. At root, the ecosystem is 
a thermodynamic system receiving, collecting, 
transforming, and dissipating solar energy. The energy 
pathways are varied and complex and lead to the diver¬ 
sity of form and services available on the earth. There¬ 
fore, it is necessary to understand the ecosystem as 
a thermodynamic system and how the energy flows 
enter, interconnect, and disperse from the environmen¬ 
tal system. Ecological network methodologies exist to 
investigate and analyze these flows. In particular, 
partitioning the flow into boundary input, noncycled 
internal flow, and cycled internal flow shows the extent 
to which reuse and recycling arise in ecosystems. The 
intricate, complex network structures are responsible 
for these processes all within the given thermodynamic 
constraints. Design of sustainable human systems 
could be informed by these organizational patterns, in 
order to use effectively the energy available. This article 
demonstrates the need for flow analysis, provides 
a brief example using a well-studied ecosystem, and 
discusses some of the ecosystem development tenden¬ 
cies which can be addressed using ecosystem flow 
analysis. 

Introduction 

Ecosystems, like all environmental systems, are open, 
thermodynamic systems (Fig. 1). They take in energy 
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Input-State-Output 

Ecosystem Flow Analysis. Figure 1 

Environmental systems are open systems, connecting to 
the environment through inputs of high-quality energy 
and discarding low-quality outputs 


from an external energy source - almost entirely from 
solar energy, although geothermal or geochemical 
energy drives some systems. Ecosystem structure is 
built with the energy, and then the degraded energy is 
passed back to the environment. For some time period 
the “stored solar energy” persists in the forms perceived 
on the earth’s surface as biomass stores of living and 
nonliving organic material - such as all of us. In Frank 
Herbert’s novel Dune , he envisioned that on an arid 
planet an important functional role of each individual 
was “carrying” water in oneself. In a world dominated 
by thermodynamic constraints, such as ours, everyone 
is an energy carrier. These stores are temporary and 
fluid. In this perspective, one could substitute the cur¬ 
rent object-mode paradigm with a flow-based, 
rheomode. In other words, all stocks are flows. The 
paper that one writes on now is only an ephemeral 
stage of the energy which started in the solar reactions, 
traveled to earth, was captured by a photosynthesizing 
organism, converted into storage in the xylem of that 
organism, and harvested and transformed into the use- 
able product that one now holds. But that is not the end. 
Over time, the paper will slowly degrade or decompose, 
or perhaps the energy release will be sudden through 
combustion. In any case, the objects held are transitory 
states in a long-term dynamic from (energy) source to 
(energy) sink. Changing this view would help in appre¬ 
ciating better the difference between capital and income, 
because, for example, harvesting natural capital stock 
(i.e., a forest) into a flow (deforestation) is not equiva¬ 
lent yet treated as substitutable in the current accounts. 
The rheomode approach could help focus clearly on the 
difference and thus sustainability of stocks and flows. 


Concerned about sustainability over a human time 
horizon, one must be aware of the constraints imposed 
by these sources and sinks. From the input side, clearly, 
it is necessary that one does not extract resources at 
a rate faster than they can regenerate. And concerning 
the output, the waste emissions should not exceed to 
the assimilative capacity of the local environment [ 1 ]. 
These are the most basic constraints imposed by open- 
system, thermodynamics. Humans have transformed 
the earth’s surface to maximize the capture of photosyn¬ 
thetic energy - think of the millennia over which the 
Chinese, Romans, Babylonians, etc., have manipulated 
and manicured the landscape for agricultural produc¬ 
tion. Still, these societies rose and fell within the solar 
energy domain. These societies collapsed if they 
overconsumed the base resources or if they polluted 
their local environments [2] . In addition to these persis¬ 
tent input-output constraints, there is a third sustain¬ 
ability consideration currently observed in the 
anthropocene, in that it is not only the input-output 
relations, but also the structure which is created. Modern 
infrastructure demands the continual input of high- 
quality (low entropy) energy of a form not naturally 
delivered by ecosystem services. Furthermore, the cre¬ 
ated structure locks us into the necessity of immense 
energy flows for maintenance. Perhaps an apt analogy 
can be offered through the Greek myth of Erysichthon, 
who was King of Thessaly. He angered the gods by 
cutting down a sacred tree, and as punishment was 
insatiably hungry. Importantly, the more he ate, the 
hungrier he became. Our infrastructure, like 
Erysichthon, does not sit idle but continually demands 
upkeep such that the more structure, the more resources 
are needed to support this structure. It is not just about 
the present flows, but also the life cycle debt commit¬ 
ment as a result of the structure. Today that energy debt 
is paid almost entirely in the usage of fossil fuels - 
a nonrenewable resource. The scale of human activity 
seen today is because the application of fossil fuels to 
substitute solar fuels has released humans from one of 
the long-standing constraints on growth. And as a result, 
humans have exploded across the landscape. This growth 
was easy when there was sufficient energy to add to the 
system. In fact, the first growth form is boundary growth, 
taking energy into the system, and storing it as bio¬ 
mass. As long as there is more energy available, the 
system growth can occur unbound. The second stage 
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of growth, network and information growth , is squeez¬ 
ing more utility out of the available source by coupling 
processes and improving efficiencies [3]. This can 
occur in parallel with boundary growth, but becomes 
a necessity when boundary growth is limited. In human 
systems, those immediate constraints are looming at 
least given the current fuel-mix options. 

Natural ecosystems, dependent on the solar energy 
flows developed extremely complex and beautiful 
structures within these thermodynamic constraints. 
And it is a useful guide to learn from these systems 
as a more eco-friendly design is incorporated. Below, 
some of the ecosystem flow analysis methodologies are 
explored and applied. 

Investigation of Ecosystem Flows Using 
Network Analysis 

“There are no trash cans in nature.” This is a useful 
phrase reminding that waste from one entity is food/ 
input for another. Energy, of course, has a higher 
dissipative factor in the reuse than material cycles 
such as nitrogen, phosphorus, and calcium, but still 
there is a complex network of pathways designed to 
utilize the energy available in natural ecosystems. In 
1973, Hannon [4] introduced Leontief’s input-output 
methodology into ecology, applying it to the energy 
flow structure of an ecosystem. The ecosystem is 
represented by n compartments, and the energy 
flowing into compartments, within compartments 
and exiting compartments. 

A network flow model is essentially an ecological 
food web (energy-matter flow of who eats whom), 
which also includes energy input, and nonfeeding 
pathways such as dissipative export out of the system 
and pathways to detritus. The first step is to identify the 
system of interest and place a boundary (real or con¬ 
ceptual) around it. Energy-matter transfers within the 
system boundary comprise the network; transfers 
crossing the boundary are either input or output to 
the network, and all transactions starting and ending 
outside the boundary without crossing it are external to 
the system and are not considered. The energy inflows 
and intra-system flows can be considered the produc¬ 
tion energy flow and the flows with no consumers such 
as metabolic energy and exported biomass are the 
respiration energy flow [5]. 


The data required for ecological network analysis 
are as follows: for each compartment in the network, 
the biomass and physiological parameters, such as con¬ 
sumption (C), production (P), respiration (R), and 
egestion (E), must be quantified [6]. Furthermore, the 
diet of each compartment must be apportioned 
amongst the inputs from other compartments (con¬ 
sumption) in the network. This apportionment of 
“who eats whom and by how much” can be depicted 
in a dietary flow matrix , F, where energy flows from 
column elements j to row elements i. For all compart¬ 
ments, inputs should balance outputs (C = P + R + E) 
in accordance with the conservation of matter and the 
laws of thermodynamics. 

The sum of the flow matrix elements, fp gives the 
total inflow to compartmental i such that: 

n 

Tijn = ^ ^ fij H - Zj 

M 

where z x - is the boundary flow into i. The outflow from 
i can be expressed as: 

n 

Ti,out = Yj p + yi 

j =1 

where is the boundary outflow from i. At steady state, 
a necessary condition for the network flow analysis, 
T i>in = T iyOUt and one compartmental throughflow 
vector can be written as T = (Tf. The total system 
throughflow (TST) is given by the sum of the compart¬ 
mental throughflows: 

n 

TST = Ti 

j=i 

The motivation for flow partitioning begins with 
nondimensional flow intensities (i.e., throughflow- 
specific flows) which result when flows are divided 
by throughflows of originating compartments: gp = 
fj/Tp The elements of matrix G = (gif) give the dimen¬ 
sionless transfer efficiencies corresponding to each 
direct flow, fp Powers G m of this matrix give the indi¬ 
rect flow intensities associated with paths of lengths 
m = 2, 3, .... Due to dissipation, flow along these 
indirect paths approaches zero as m —> oc so that 
the power series Ylm=o representing the sum 



Ecosystem Flow Analysis 


E 


3309 


of the initial, direct, and indirect flows converges to an 
integral flow intensity matrix, N: 

^ = ^ + ^G + G 2 +G 3 + +G m + ... 

integral initial direct indirect 

= {\-G)~ l 

N maps the steady-state input vector z into the 
steady-state system throughflow vector: 

T = Nz = (I + G + G 2 + G 3 + ... + G m + ...)z 

Term by term, flow intensities G m of different 
orders m are propagated over paths of different lengths 
m. The first term, I, brings the input vector z across the 
system boundary as input Zj to each initiating compart¬ 
ment, j. The second term, G, produces the first-order 
direct transfers from each j to each i in the system. The 
remaining terms where m > 1 define m th order indirect 
flows associated with length m paths. As stated before, 
these go to zero in the limit as m —> oo, which is 
necessary for series convergence. This demonstrates 
that each “direct” flow fj at steady state is actually 
composed of flow elements of all orders, m = 1,2, .... 
In fact, a major result of this flow analysis is that 
indirect flows can dominate direct flows: 
J2m=2 G m > G. In the above developments F, T, z, 
and y represent matter or energy fluxes, and G and N 
are dimensionless intensive flows. 

Finn [7] developed a cycling index using this basic 
approach and, Higashi et al. [8] described a three-mode 
partition of the flows, expanded by Fath et al. [9] into 
five modes (Table 1). Mode 0 is the boundary input into 
the system. Mode 1 accounts for all flow in which sub¬ 
stance moves from node j to a terminal node i for the 
first time only without cycling. Mode 2 is flow cycled at 
terminal nodes i of each (i, j) pair. Mode 3 is component¬ 
wise dissipative flow in the sense that it exits from node 
i never to return again to i. Mode 4 is the boundary 
output from i constituting systemically dissipative flows 
exiting the system (Fig. 2). by is the Kronecker delta 
defined by by - 1 for i = j and by = 0 for i ^ j. 

Note, the symmetry in that quantitatively Mode 
0 = Mode 4, and Mode 1 = Mode 3. This is due to the 
conservation of mass/energy and at steady state what 
comes in must go out. Mode 2 represents the cycled 
flow which has additional impact on the system by 
staying in the system longer, increasing the residence 


Ecosystem Flow Analysis. Table 1 Network representa¬ 
tion of flow partitioning into five modes for any (i, j) pair in 
a system 
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Mode 3 
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time, and returning to its source of emanation. There¬ 
fore, total system throughflow can be written as: 

tst =/ (0) +/ (1) +/ (2 ) =/ (2) +/ (3) +/ (4) 


And, on a nodal basis, throughflow is: 


**=#+#+# = ' 


Zjbij -h — - 8ij\zj H— L (n u 




\)Zj 


The mode partition designation clearly shows the 
contribution of flow within the entire system of 
interactions. 


Example: Cone Spring Ecosystem Model 


A classic example is the Cone Spring ecosystem model 
developed by Tilly [10]. In this model, there are five 
compartments representing: (1) plants, (2) bacteria, 
(3) detritivores, (4) carnivores, and (5) detritus 
(Fig. 3). There are 2 external inputs (to plants and 
detritus), 8 internal flows, and each compartment has 
boundary outflow representing metabolic or egestion 
losses. The internal flows from columns j to rows i are 
given by: 


F = 


0 

8881 

0 

0 

0 


0 

0 

5205 

2309 

0 


0 0 

1600 200 

0 0 

75 0 

0 370 


0 

167 

0 

0 

0 



















3310 


E 


Ecosystem Flow Analysis 


System boundary 



mode 1 
first passage 


f(0)+f(1 )+f(2)=f(2)_|_f(3)_|_f(4) 


mode 3 
dissipation 


Ecosystem Flow Analysis. Figure 2 

Schematic of flow partitioning for a central node i, in relation to other network compartments. Flow reaches the node 
directly across the boundary, f (0) , by passing through other compartments before reaching i, f (1) , and leaving i to cycle 
back again, f (2) . Outflows symmetrically mirror these inputs 
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Network diagram of Cone Spring ecosystem energy flows [10]. All flows are in kcal/m 2 /year. Biomasses are in kcal/m 2 . 
Green arrows are exogenous boundary inflows. Black arrows are exports of useable energy. Red ground symbols represent 
metabolic energy loss 
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Compartmental throughflows are: T = [11 184, 11 
484, 5 204, 2 384, 370] and TST = 30 627. 

The nondimensional flow fractions are given by: 



0 

0 

0 

0 

0 


0.794 

0 

0.308 

0.0840 

0.45] 

G = 

0 

0.453 

0 

0 

0 


0 

0.201 

0.014 

0 

0 


0 

0 

0 

0.155 

0 

And the integral flow 

matrix 

is: 



1 

0 

0 

0 

0 


0.958 

1.207 

0.374 

0.186 

0.545 

N = 

0.434 

0.547 

1.169 

0.084 

0.247 


0.199 

0.251 

0.092 

1.039 

0.113 


0.031 

0.039 

0.014 

0.161 

1.018 


Looking at Table 2, it is seen that over 38% of the 
flow comes directly to a node from the first instant 
across a system boundary and 52% of the flow origi¬ 
nates from one compartment and enters another com¬ 
partment for the first time without cycling. Slightly 
over 9% of the total energy flow is material that has 
cycled by exiting and reentering the same compart¬ 
mental node. In other words, about 3,000 kcal/m 2 /y of 
the total system throughflow is comprised of energy 
due to cyclic pathways which retain the energy in the 
system. This additional boost is important to the over¬ 
all function of the Cone Spring ecosystem. A noticeable 
contribution of cycled flow is a common phenomenon 
in all ecosystems. Another way to demonstrate this 
importance of cycling, and the fundamental shift it 
has on how an ecosystem should be viewed, was given 
by Braner [11]. While investigating the same five- 
compartment Cone Spring model above, he showed 


Ecosystem Flow Analysis. Table 2 Results of the flow 
analysis partitioning for Cone Spring ecosystem. Values 
represent the fraction of total system throughflow for each 
mode 


Mode 0 

0.386 

Mode 1 

0.522 

Mode 2 

0.092 

Mode 3 

0.522 

Mode 4 

0.386 


that cyclic pathways identified by flow analysis 
reveal that the original boundary flow persists in the 
system much longer than obviously apparent. For 
contrast, in a five-compartment food chain model - 
a type often used, incorrectly, to represent an ecosys¬ 
tem - the longest path could only be four steps 
in length from XI —* X2 —> X3 —► X4 —► X5. Real 
ecosystems have more complex structures with cycles. 
After those four steps, the original flow from compart¬ 
ment 1 would exit the system at compartment 5. 
According to Braner, more than 10% of the flow 
remains in the Cone Spring ecosystem after four 
steps. In fact, approximately 1% of the original flow 
remains after 9 steps and 0.001% is left after 15 steps 
(Fig. 4). A similar result is shown for two other eco¬ 
systems in the same figure. Therefore, the cycles, evi¬ 
dent from flow analysis, play a very important role in 
the system having enough resource to function and 
provide ecosystem services. 

Ecosystem Goal Functions 

Flow analysis has another useful feature related to 
understanding ecosystem dynamics. Odum [12] pro¬ 
posed 24 different attributes which describe the ecosys¬ 
tem development, for aspects such as community 



Path Length 


Ecosystem Flow Analysis. Figure 4 

Portion of original energy that remains as path length 
increases (From Braner [11], reprinted with permission). 
There is still noticeable energy in the system after 10 steps 
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energetics, nutrient dynamics, and overall homeostasis. 
The attributes dealing with energetics, which change 
during the ecosystem development, have also been 
formulated as ecological goal functions - which 
describe observable macroscopic patterns over time. 
They are not strict goal functions in the sense of math¬ 
ematical optimization models (neither is economic 
utility although it is used as such), but indicate the 
tendency for ecosystems to follow during development, 
for example, during succession from early r-selected 
species and r-selecting environments to late K-selected 
species and K-selecting environments. Some of the 
more common goal functions employed include: 
maximum power [13], maximum dissipation [14], 
maximum cycling [15], maximum residence time 
[16], minimum specific dissipation [17,18], maximum 
energy [19], and maximum ascendency [20] (see [9], 
for a detailed description of these). The idea is that the 
ecological network self-organizes itself in a way that 
leads to directional change in the property of these 
values. For example, maximum power, interpreted to 
mean the maximum throughflow in the network is 
given by: TST = / 0) + / (1) + / (2) . Therefore, TST 
increases when there is more boundary flow (mode 0), 
more first passage flow (mode 1), or more cycled flow 
(mode 2). The mechanisms for this to increase practi¬ 
cally relate to the system’s ability to capture more 
boundary flow by increasing the uptake. Both first 
passage flow and cycled flow also depend on the second 
stage of growth exemplified by the structure of the 
network and the efficiency of flows along each connec¬ 
tion. Similar rationale can be made for the other goal 
functions listed above, and in fact it has been shown 
that the goal functions are complementary and mutu¬ 
ally reinforcing in that the realization of one generally 
promotes the others. Together they provide a holistic 
view of ecosystem development through different ther¬ 
modynamic perspectives. Again, the value of this eco¬ 
system knowledge is obvious for application to design 
and to manage human systems sustainably. If ecosys¬ 
tem services are required, then the inherent dynamics 
of the systems used should be better understood. 
Human activities in line with these directions will be 
supported by natural processes, those that do not will 
experience additional resistance and therefore addi¬ 
tional cost and difficulty. Humans are better off work¬ 
ing with nature than against it if possible. 


Conclusions and Future Directions 

Ecosystem flow analysis clearly shows that the distri¬ 
bution of energy flow in a network is not simple. Some 
significant fraction of the energy remains in the system 
and cycles before exiting the system. This insight 
was evident in R. Lindeman’s [21] seminal work on 
Cedar Bog Lake in which he referred to his eight- 
compartment ecosystem as a “food-cycle.” Unfortu¬ 
nately, he did not have the quantitative tools at his 
disposal, like flow analysis, and to simplify the calcula¬ 
tions, proceeded to analyze the system according to two 
distinct “food-chains,” although in reality they are 
linked and contain cycles. Further work in this area 
also neglected the presence and significance of 
food cycles until research in the mid-1970s (such 
as [22-25], and others) when network analysis tech¬ 
niques developed sufficiently to provide a holistic 
investigation of the ecosystem function. As stated 
above, this changes the way one must look at ecosys¬ 
tems, as processors and stores of energy flow. 
The energy does not pass quickly through but 
can remain and impact the system indirectly. The 
good news is that this flow which remains in the 
system is able to positively drive ecosystem processes 
and contribute to the overall sustainability of the 
system. 

However, the lesson to take is that in the design of 
human systems, industrial processes are built sequen¬ 
tially, which have raw material —> processing —> 
product —> disposal. There is little room for cycling 
and reuse. Remember, there are no trash cans in nature. 
Everything has a use and reuse. Efforts are now seen in 
industrial ecology promoting closed loop engineering 
and cradle-to-cradle considerations, but there is a long 
way to go, as evidenced by the massive amounts of raw 
material input and solid waste generated on a daily 
basis by human activity. Also, the flow analysis must 
include all parts and processes of the holistic integrated 
socio-ecological system. Future work is needed to con¬ 
tinue to understand energy cycles in natural systems 
and furthermore, how to implement lessons from these 
into the design of socio-ecological systems. Ecosystem 
flow analysis clearly shows the input-output orienta¬ 
tion flow resources have at their disposal for 
maintaining functional activity and can aid in sustain¬ 
ability science. 
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Glossary 

Ecosystem services The wide array of benefits that 
ecosystems, and their biodiversity, confer on 
humanity. 

Marginal value The economic value of the next incre¬ 
mental unit of something. In this context, marginal 
values are those associated with managing the next 
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small unit of an ecosystem in a particular way (e.g., 
preserving, rather than clearing, the next unit of 
forest). They can also be the partial contribution of 
natural capital to a final good that is produced with 
other inputs. For example, the marginal value of 
irrigation water for crop production is the value of 
the incremental crop yield that can be attributed to 
irrigation, rather than to labor, fertilizer, and other 
inputs. 

Natural capital Here we focus on living, renewable 
forms of natural capital, which constitute a stock - 
of an ecosystem and the biota that makes it up - that 
generates a flow of ecosystem services. For example, 
a forest constitutes a stock that generates a flow of 
timber, carbon sequestration, water quality, biodi¬ 
versity, serenity, and other benefits, depending upon 
how it is managed. (Fossil fuels and other minerals 
constitute nonliving natural capital, which is gener¬ 
ally nonrenewable on time scales of interest to 
society.) 

Definition 

Ecosystem services are essential to sustaining and ful¬ 
filling human life, and yet their supply is seriously 
threatened by the intensification of human impacts 
on the environment. Over the past decade, efforts to 
value and protect ecosystem services have been pro¬ 
moted by many as the last best hope for making con¬ 
servation mainstream - attractive and commonplace 
worldwide. In theory, if institutions recognize the 
values of nature, then we can greatly enhance invest¬ 
ments in conservation and foster human well-being at 
the same time. In practice, scientific and policy com¬ 
munities have not yet developed the scientific basis or 
the policy and finance mechanisms for integrating nat¬ 
ural capital into resource and land-use decisions on 
a large scale. 

Introduction 

This entry provides an overview of issues concerning 
the identification, biophysical and economic character¬ 
ization, and safeguarding of ecosystem services. The 
concept of ecosystem services has a long written his¬ 
tory, reaching back at least as far as Plato. We review 
this history, including a focus on recent advances such 


as the Millennium Ecosystem Assessment, and 
advances in spatial modeling, economic valuation, 
and policy implementation. We provide examples of 
novel finance and policy mechanisms, including water 
funds, marine spatial planning, land-use planning and 
human development, and global policy efforts. We 
conclude with a discussion of the largest research and 
implementation challenges in this field, identifying the 
issues that will frame the future growth of the concept 
of ecosystem services. 

What are Ecosystem Services? 

Definition and Classification 

Ecosystem services are defined simply as the benefits 
that people obtain from ecosystems [38]. They sustain 
and fulfill human life and flow from many conditions 
and processes of ecosystems, and the species making 
them up [14]. The processes and features generating 
ecosystem services (ES) are so tightly interconnected 
that any classification is inherently somewhat arbitrary. 
The most widely used classification was developed 
through the Millennium Ecosystem Assessment (MA) 
and identifies four classes of ES based on their types of 
benefits to society: 

1. Provisioning services including the production of 
goods such as food, water, timber, and fiber 

2. Regulating services that stabilize climate, moderate 
risk of flooding and disease, and protect or enhance 
water quality 

3. Cultural services that provide recreational, aesthetic, 
educational, community, and spiritual opportunities 

4. Supporting services that underlie provision of the 
other three classes of benefits, including soil forma¬ 
tion, photosynthesis, nutrient cycling, and the pres¬ 
ervation of options (Fig. 1; [38]). 

The classification of ES is still a topic of debate and 
several other classification approaches have been 
suggested [7, 19, 22, 67]. 

System- and scale-neutral, the ecosystem services 
framework applies equally to terrestrial, freshwater, 
and marine ecosystems and their processes, anywhere 
on the spectrum from relatively pristine to heavily 
managed conditions. Indeed, all ecosystems provide, 
to differing degrees, a set of ES. Human conversion of 
ecosystems from one type to another is often motivated 
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ECOSYSTEM SERVICES 

Provisioning 

FOOD 

FRESH WATER 
WOOD AND FIBER 
FUEL 


Supporting 

NUTRIENT CYCLING 
SOIL FORMATION 
PRIMARY PRODUCTION 


Regulating 

CLIMATE REGULATION 
FLOOD REGULATION 
DISEASE REGULATION 
WATER PURIFICATION 


Cultural 

AESTHETIC 

SPIRITUAL 

EDUCATIONAL 

RECREATIONAL 


LIFE ON EARTH - BIODIVERSITY 



CONSTITUENTS OF WELL-BEING 


Security 


PERSONAL SAFETY 

SECURE RESOURCE ACCESS 
SECURITY FROM DISASTERS 


Basic material 
for good life 

Freedom 

ADEQUATE LIVELIHOODS 

of choice 

SUFFICIENT NUTRITIOUS FOOD 

and action 

SHELTER 

ACCESS TO GOODS 

OPPORTUNITY TO BE 
ABLE TO ACHIEVE 


WHAT AN INDIVIDUAL 

Health 

STRENGTH 

VALUES doing 

AND BEING 

feeling well 

ACCESS TO CLEAN AIR 

AND WATER 


Good social relations 


SOCIAL COHESION 

MUTUAL RESPECT 

ABILITY TO HELP OTHERS 



Source: Millennium Ecosystem Assessment 
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Ecosystem Services. Figure 1 

Ecosystem service categories and their linkages to human well-being as described in the Millennium Ecosystem 
Assessment [38] 


by a desire for a different set of ES, though consider¬ 
ation of the services of the two systems and their 
tradeoffs is often incomplete. 

Ecosystem Services Across Systems 

A wide range of ES is generated in the terrestrial realm, 
by croplands, natural and managed forests, grasslands, 
and wetlands. In each of these systems, for example, 
vegetation can protect and enhance soils, preventing 
their loss through erosion and improving fertility by 
retaining moisture and storing and recycling nutrients. 
Vegetation and soils together regulate the quantity, 
quality, and timing of water flows, thus moderating 
floods and droughts and providing cleaner, more reli¬ 
able supplies [8]. 


Forests stand out as important in regulating water 
and carbon cycles [30]; in their strong influence on 
local, regional, and global climate [30, 49]; and because 
of the multiple, interacting threats to their future 
(e.g., [44, 66]). They also provide natural products 
for subsistence use or sale including timber, firewood, 
mushrooms, fruits and seeds, medicinal plants, rub¬ 
ber, cork, and bushmeat. Forest and woodland habi¬ 
tats harbor species that provide pollination and pest 
control to commercial or subsistence crops. Grassland 
and other dryland systems play these same critical 
roles in addition to supporting vast livestock 
populations [38]. Wetlands occupy a small fraction of 
Earth’s surface, but dominate the landscape where they 
are concentrated and provide a wide array of water 
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quality, flood mitigation, coastal protection, and 
biogeochemical services [38]. Each of these systems, 
however natural or managed, can provide habitat 
for biodiversity and opportunities for recreational 
activities, spiritual experiences, and creative, cultural 
expression. 

Freshwater ecosystems provide a suite of highly 
visible and widely appreciated ES [53]. The freshwater 
regulated by terrestrial systems and the atmosphere is 
used for drinking, hydropower production, irrigation, 
household activities (washing, etc.), industrial pur¬ 
poses (cooling, manufacturing, etc.), and cultural 
experiences. People also gain large revenues and nutri¬ 
tion from freshwater fisheries and aquaculture. Less 
appreciated is the value of sediment transport and 
deposition in rivers that supply river reaches and 
downstream beaches with important sand and gravel 
resources. Wetlands and other aquatic vegetation can 
regulate flood waters and cycle nutrients, improving 
water quality. Finally, freshwater systems serve as path¬ 
ways for human transportation and recreational or 
cultural activities. 

Marine ecosystems also provide all four classes of 
ecosystem services described in the MA. Marine fish¬ 
eries and aquaculture provide nutrition, feed for ani¬ 
mals, livelihoods, and important recreational and 
cultural opportunities. Harvests of other species for 
food additives, cosmetics, and pharmaceuticals also 
support health, nutrition, and livelihoods. Marine 
biogenic habitats (such as coral reefs, oyster reefs, 
and kelp forests) regulate natural hazards including 
storm surges, and may play a critical role in helping 
coastal communities adapt to sea level rise. Marine 
systems also transform, detoxify, and sequester wastes. 
In addition, oceans are the center of the global water 
cycle; they hold 96.5% of the Earth’s water [24] and are 
a primary driver of the atmosphere’s temperature, 
moisture content, and stability [12]. Oceans are also 
key players in the global cycles of carbon, nitrogen, 
oxygen, phosphorus, sulfur, and other major elements 
[51] and are responsible for approximately 40% of 
global net primary productivity [39, 61]. Finally, 
coastal communities reap many benefits from coastal 
tourism (one the one of world’s most profitable indus¬ 
tries [65]), and numerous coastal communities define 
their very identities in relation to the sea and all it 
brings. 


The Ecosystem Service Supply Chain 

Ecosystem services flow to people along a supply chain 
from biophysical systems to people [63]. All services 
are generated by some function or element of a natural 
or managed system (Fig. 2). The full suite of these 
elements or functions can best be considered in three 
discrete steps: supply, service, and benefit. For example, 
consider protection from coastal storm surges. Many 
different types of coastal elements (e.g., coral reefs, 
mangroves, oyster beds, barrier islands) confer protec¬ 
tion from storm surges by attenuating waves. The full 
set of locations of these coastal elements represents the 
supply of protection from storms. People do not 
receive storm surge protection from all of these loca¬ 
tions, however, because some are far from human 
infrastructure and settlement. Both the distribution of 
human infrastructure and settlement, together with the 
location and condition of supply, are required to give 
a clear picture of how much “service” is actually deliv¬ 
ered at a given time. Finally, the service delivered to 
human communities is often valued differently, 
depending on the context. For example, coastal protec¬ 
tion services provided by nearshore habitats to easily 



Ecosystem Services. Figure 2 

Three measurement points for ecosystem services [63]. 
Supply metrics deal only with the biophysical system 
underpinning the service of interest. Service metrics 
include critical information linking supply to beneficiaries. 
Benefit metrics weigh the level of service based on people's 
preferences or social policy goals 
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accessible, popular, public beaches might be seen as 
more valuable, or providing greater benefit, than 
those to more remote sites. 

History of the Concept of Ecosystem Services 

It is primarily through disruption and loss that the 
nature and value of ecosystem services has been illumi¬ 
nated. For instance, deforestation has demonstrated 
the critical role of forests in the hydrological cycle - in 
particular, in mitigating floods, droughts, the erosive 
forces of wind and rain, and the silting of dams and 
irrigation canals. Release of toxic substances, whether 
accidental or deliberate, has revealed the nature and 
value of physical and chemical processes, governed in 
part by a diversity of microorganisms, that disperse and 
break down hazardous materials. Thinning of the 
stratospheric ozone layer sharpened awareness of the 
value of its service in screening out harmful ultraviolet 
radiation. And the loss of coastal wetlands has brought 
into relief their importance in regulating coastal haz¬ 
ards such as hurricanes and tsunamis. 

Initial Development of the Ecosystem Services 
Concept 

A cognizance of ecosystem services, expressed in terms 
of their loss, dates back at least to Plato and probably 
much earlier: 

► What now remains of the formerly rich land is like the 
skeleton of a sick man with all the fat and soft earth 
having wasted away and only the bare framework 
remaining. Formerly, many of the mountains were ara¬ 
ble. The plains that were full of rich soil are now 
marshes. Hills that were once covered with forests 
and produced abundant pasture now produce only 
food for bees. Once the land was enriched by yearly 
rains, which were not lost, as they are now, by flowing 
from the bare land into the sea. The soil was deep, it 
absorbed and kept the water..., and the water that 
soaked into the hills fed springs and running streams 
everywhere. Now the abandoned shrines at spots 
where formerly there were springs attest that our 
description of the land is true. (Plato) 

Mooney and Ehrlich [40] trace modern concern for 
ecosystem services to George Perkins Marsh, a lawyer, 
politician, and scholar. Indeed, his 1864 book Man and 


Nature describes a wide array of services, again, often 
expressed in terms of their loss. Remarking on the 
terrain of the former Roman Empire, he notes that it 
“is either deserted by civilized man and surrendered to 
hopeless desolation, or at least greatly reduced in both 
productiveness and population” (p. 9). He continues, 
describing the reduction of hydrological services: “Vast 
forests have disappeared from mountain spurs and 
ridges, the vegetable earth ... [is] washed away; 
meadows, once fertilized by irrigation, are waste and 
unproductive, because ... the springs that fed them 
dried up; rivers famous in history and song have 
shrunk to humble brooklets” (p. 9). He also draws 
connections between deforestation and climate: “With 
the disappearance of the forest, all is changed. At one 
season, the Earth parts with its warmth by radiation to 
an open sky - receives, at another, an immoderate heat 
from the unobstructed rays of the sun. Hence the 
climate becomes excessive, and the soil is alternately 
parched by the fervors of summer, and seared by the 
rigors of winter. Bleak winds sweep unresisted over its 
surface, drift away the snow that sheltered it from the 
frost, and dry up its scanty moisture” (p. 186). Finally, 
he even wrote of decomposition services: “The carniv¬ 
orous, and often the herbivorous insects render an 
important service to man by consuming dead and 
decaying animal and vegetable matter, the decomposi¬ 
tion of which would otherwise fill the air with effluvia 
noxious to health” (p. 95). 

Other eloquent writers on the environment 
emerged following World War II, including Fairfield 
Osborn ( Our Plundered Planet , 1948), William Vogt 
(Road to Survival , 1948), and Aldo Leopold (A Sand 
County Almanac and Sketches from Here and There , 
1949). Each discusses ecosystem services without 
using the term explicitly. In The Population Bomb 
(1968), Paul Ehrlich describes anthropogenic disrup¬ 
tion of ecosystems and the societal consequences of 
doing so, addressing the need to maintain important 
aspects of ecosystem functioning. Along these lines, the 
Study of Critical Environmental Problems (1970) pre¬ 
sents a list of key “environmental services” that would 
decline with a decline in “ecosystem function.” This list 
was expanded upon by Holdren and Ehrlich [29]. 
Meanwhile, in the 1960s and 1970s, economists set out 
to measure “the value of services that natural areas 
provide” ([35], p. 12), with efforts focused on 
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agricultural production [3], renewable resources 
[11, 34], nonrenewable resources [18], and environ¬ 
mental amenities [23]. 

By the early 1980s, efforts were initiated to investigate 
two questions: the extent to which ecosystem function 
(and the delivery of services) depends on biodiversity, 
and the extent to which technological substitutes could 
replace ecosystem services. The first question is 
addressed in chapter ► Species Diversity Within and 
Among Ecosystems, this volume. The second question 
was tackled by Ehrlich and Mooney [21] . Work on these 
topics proliferated and, in 1997, a collective effort was 
made to synthesize the wealth of scientific information 
that had accumulated on the functioning of ecosystem 
services, with a preliminary exploration of their economic 
value, and of key issues meriting further work [14]. 



Economic & 
Cultural Models 


Ecosystem Services. Figure 3 

A framework showing key elements for integrating ES into 
decision-making [16]. One could link any two ovals, in 
any direction, in different decision contexts 


Recent Advances 

Four major advances of the last decade have revitalized 
research on ecosystem services and brought them into 
the public eye. First, the MA represented a visionary 
and seminal step in global science - it was the first 
comprehensive global assessment of the status and 
trends of all of the world’s major ecosystem services. 
It was requested by United Nations Secretary General 
Kofi Annan in 2000 and carried out between 2001 and 
2005 with contributions from over 1,360 experts 
worldwide. The key finding of this assessment was 
that two thirds of the world’s ecosystem services were 
declining [38]. This captured the attention of world 
leaders and emphasized the connections between 
human decisions and the natural environment that 
feed back to the human condition via changes in the 
flow of ecosystem services. 

Work following the MA clarified this chain of con¬ 
nections (Fig. 3) [16]. Human decisions shape individ¬ 
uals’ actions relating to the use of land, water, oceans, 
and other elements of natural capital. These actions 
often alter the state or functioning of ecosystems, 
which in turn provide altered flows of benefits (goods 
or services) to people. People express different values 
(monetary, cultural) associated with these altered 
streams of benefits and it is the expression of these 
values that leads to changes in institutions that guide 
decisions. The following three recent advances all con¬ 
cern the connections in this flow. 


A suite of recent advances has greatly improved 
understanding of the links between ecosystem func¬ 
tions and processes and the provision of ecosystem 
services (Fig. 3). For some ecosystem services, we now 
better understand the key ecological system compo¬ 
nents that drive provision (e.g., [33]) and we can now 
measure (e.g., [56]) and model, with uncertainty, the 
impacts of land use and resource management deci¬ 
sions on a wider variety of ecosystem processes and 
associated services. Ecological science has also 
advanced spatially explicit modeling, which is essential 
for mapping ecosystem services and their flows to peo¬ 
ple (e.g., [10, 27, 43, 57]). Finally, we are starting to see 
patterns in how multiple ecosystem services and biodi¬ 
versity change in relation to each other. Recent work 
has started characterizing bundles of ecosystem ser¬ 
vices, and exploring their synergies and trade-offs 
(e.g., [4, 6, 20, 42, 43]). 

Further, economic valuation methods have been 
applied to the spatial provision of ecosystem services 
to estimate the monetary value of benefits and, in some 
cases, the distribution of those benefits to various seg¬ 
ments of society [42, 45, 52, 64]. In addition, qualita¬ 
tive and quantitative methods from other social 
sciences have been applied to gain better understanding 
of the social and cultural importance of ecosystem 
services (e.g., [38]). 

Fastly, experiments in payments for ecosystem ser¬ 
vices [47, 48, 69], in ecosystem-based management [2], 
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and in regional planning have begun, giving us oppor¬ 
tunities to learn about how science can play a role in 
altering institutions, and how institutions alter deci¬ 
sions and the resulting flow of ecosystem services. The 
following section describes some of these efforts in 
more detail. 

Incorporating Ecosystem Services into Decisions 

Today, the urgent challenge is to move from theory to 
practical implementation of ES tools and approaches in 
resource decisions taken by individuals, communities, 
corporations, and governments. The framework in 
Fig. 3 connects the science of quantifying services with 
valuation and policy to devise payment schemes and 
management actions that take account of ecosystem 
services. This connection is expressed in the real world 
in a variety of ways across scales from local to global. 

A great number and diversity of efforts to imple¬ 
ment the ES framework have emerged worldwide over 
the past decade. Individually, most of these efforts are 
small and idiosyncratic. But collectively, they represent 
a powerful shift in the focus of conservation organiza¬ 
tions and governments (primarily) toward a more 
inclusive, integrated, and effective set of strategies 
[15]. Taken together, these efforts span the globe and 
target a full suite of ecosystem services, including prin¬ 
cipally forest-generated services of carbon sequestra¬ 
tion, water supply, flood control, biodiversity 
conservation, and enhancement of scenic beauty (and 
associated recreation/tourism values) [26, 62]. 

Many local or regional ES efforts focus on a single 
service that stands out as sufficiently important, from 
economic and political perspectives, to overcome the 
activation energy required to protect it. Under the 
institutional umbrella created for the focal service, it 
is possible that other services may be at least partially 
protected. Beginning in the late 1990s, larger-scale 
investment in natural capital for water flow regulation 
in China - and for a broad suite of ES in Costa Rica - 
set pioneering examples that are now being adapted 
elsewhere and scaled up. 

Next, we briefly describe some contrasting models 
of success, at different scales and in different kinds of 
social-ecological systems. In each case, there is an acute 
or looming crisis, innovative leadership, and pursuit of 
dual goals: improving both human and ES condition. 


Local Scale: Water Funds 

New York City made one of the first and most famous 
investments in ecosystem service provision in recent 
history, in the mid-1990s. The city invested ca. USD 1.5 
billion in a variety of watershed protection activities to 
improve drinking water quality for 10 million users 
rather than spending the estimated USD6-8 billion 
needed (excluding annual operating and maintenance 
costs) for building a new filtration plant. This seminal 
example is widely cited as evidence of the business case 
for investing in natural capital instead of built capital 
[ 15] . Yet the effort remains very much an experiment in 
the science and policy of investing in natural capital, 
and one on which there is international focus. 

Globally, watersheds are now emerging as the target 
of a range of creative policy and finance mechanisms 
that link beneficiaries to suppliers through a payment 
system. In these “water funds,” water users voluntarily 
pay into a pool that is collectively managed by contrib¬ 
utors and invested in watershed management improve¬ 
ments. The Nature Conservancy (TNC) has now 
established more than ten water funds in Latin 
America, has plans to create 22 more by 2015 [25], 
and is exploring the possibility of establishing some of 
the first funds in Africa. 

Agua por la Vida y la Sostenibilidad, one of the 
recently established water funds, demonstrates the 
diversity of water users that are becoming engaged in 
these funds and the kinds of watershed management 
changes these funds motivate. Formally established 
in the Cauca Valley, Colombia in 2009, this water 
fund is supported by the region s sugarcane grower’s 
association (PROCANA), the sugar producers’ associ¬ 
ation (ASOCANA), 11 local watershed management 
groups, TNC and a Colombian peace and justice 
nongovernment organization (Vallenpaz). Each mem¬ 
ber of the water fund voluntarily pays a self-determined 
amount into the fund that is then jointly managed by 
the members to improve landscape management in 
11 watersheds covering over 3,900 km 2 . 

Members in this fund have currently committed to 
contributing USD 10 million over 5 years to be invested 
in five kinds of management changes: protection of 
native vegetation, restoration of denuded lands, 
enrichment of degraded forests, fencing of rangelands, 
and implementation of silvopastoral practices. 
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The fund is starting a monitoring program that will 
ensure that these investments lead to measurable 
improvements in water quality for approximately one 
million water users downstream and significant 
improvements in terrestrial and freshwater biodiversity. 

Local Scale: Coastal and Marine Spatial Planning 

People commonly think of oceans as relatively feature¬ 
less expanses that defy the drawing of lines on maps. 
However, recent political and scientific advances have 
highlighted the need for a comprehensive approach to 
planning marine and coastal uses and the need for 
practical tools to make this more comprehensive 
approach a reality on the ground and in the water. In 
a marine spatial plan, a wide range of uses of the marine 
environment are put on one map. But an understand¬ 
ing of how such plans are likely to yield changes in the 
delivery of the broad range of services people receive 
from the system has, until recently, remained elusive. 

Along the west coast of Vancouver Island Canada, 
multiple, often competing interests are struggling to 
define the future character of the place. Existing extrac¬ 
tive, industrial, and commercial uses; traditional First 
Nations subsistence and ceremonial uses; recreation 
and tourism; and emerging ocean uses such as the 
extraction of wave energy are all in the mix. The West 
Coast Aquatic Management Board (WCA) is charged 
with creating a marine spatial plan for the region. WCA 
is a public-private partnership with participation from 
four levels of government (Federal, Provincial, local, 
and First Nations), and diverse stakeholders. Ulti¬ 
mately, WCA’s vision is to manage resources for the 
benefit of current and future generations of people and 
nonhuman species and communities. 

Some key pillars of the partnership’s strategy are to: 
use a precautionary, ecosystem-based approach to pro¬ 
tect, maintain, and restore marine and coastal 
resources; respect and protect First Nations’ food, 
social and ceremonial requirements and treaty obliga¬ 
tions; integrate expertise and knowledge from First 
Nations, local, scientific, and other sources; ensure 
broad participation in the planning process; and foster 
initiatives that maintain or enhance opportunities for 
coastal communities to benefit from local resources, 
while achieving sustainable social, cultural, and eco¬ 
nomic benefits for the region. WCA has partnered with 


the Natural Capital Project to explore how alternative 
spatial plans might affect a wide range of ES and to 
provide information about trade-offs among ES. 

Key considerations for WCA and their stakeholders 
include balancing important industrial and commer¬ 
cial activities (such as shipping, mining, logging, aqua¬ 
culture, and fisheries), increased development of 
tourism and recreation, renewable energy generation, 
and a strong cultural desire for sustaining the remote, 
wild feeling of the place. WCA is exploring the suitabil¬ 
ity of alternative regions for these different activities. 
For example, maps of coastal vulnerability to erosion 
and flooding from storm surge are helping to direct 
coastal development permits to low-risk areas. Similar 
maps of the value of captured wave energy are being 
overlaid with existing ocean uses (e.g., fishing and 
recreational activities) to highlight regions of high 
wave energy value, where wave energy generation facil¬ 
ities might be constructed while having minimal 
impacts on other activities. Examinations of trade¬ 
offs among aquaculture (finfish, shellfish), wild salmon 
fisheries, recreation (e.g., kayaking, whale watching, 
and diving), coastal development (on the coast, as 
well as floathomes), and habitat and water quality are 
underway. 

The general framework of ES and ES modeling, in 
particular, is helping to articulate connections between 
human activities that are often considered in isolation, 
to align diverse stakeholders around common goals, 
and to make implicit decisions explicit. ES modeling 
results have informed early iterations of the marine 
spatial plan and will inform the creation of the final 
plan in 2012. 

National Scale: Land-Use Planning and Human 
Development in China 

The ecosystem service investments being made in 
China today are impressive in their goals, scale, dura¬ 
tion, and innovation. Following massive droughts and 
flooding in 1997-1998, China implemented several 
national forestry and conservation initiatives, into 
which investments exceeded 700 billion yuan 
(ca. USD100 billion) over 2000-2010 [37, 70]. The 
larger and older of these initiatives are being rigorously 
evaluated to determine their biophysical and socioeco¬ 
nomic impacts, to improve their design and efficacy. 
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These initiatives have dual goals: to secure critical 
natural capital through targeted investments across 
landscapes and regions, and to alleviate poverty 
through targeted wealth transfers from coastal prov¬ 
inces to inland regions where many ES originate. The 
Chinese government aims to reduce the loss of soil, 
improve water retention, reduce desertification, and 
generally protect biodiversity and ecosystems in the 
west of the country for flood control, hydropower 
production efficiency, irrigation supply, more produc¬ 
tive agriculture, and ecotourism. In addition, it wants 
to change the economic structure in rural areas to 
increase local household income while simultaneously 
making local households’ patterns of land utilization 
and agricultural production more sustainable [36, 37]. 

The initiatives include two national PES programs, 
the Natural Forest Conservation Program (NFCP) and 
the Sloping Land Conversion Program (SLCP), 
established in 1998 and 1999 respectively. Implemen¬ 
tation was tested in a few provinces, and then rapidly 
scaled to the whole country. Evaluation of the pro¬ 
grams shows significant achievement of the biophysical 
goals, with remarkably rapid land conversion in the 
desired directions. For example, by the end of 2006, 
the SLCP had converted ca. 9 million hectares of 
cropland into forest/grassland and had afforested ca. 
12 million hectares of barren land. Village-level field 
measurements have shown not only that the payments 
for ES have altered land use patterns, but in turn 
soil erosion has been decreased in some areas by as 
much as 68% [9]. 

Overall social impacts of the programs are mixed. In 
some places, payment levels and types are leading to 
improvements in economic measures of well-being, 
whereas in others payments were not sufficient to com¬ 
pensate for loss of income from shifting livelihoods [37] . 
In addition, in some places where participation in the 
SLCP has significant positive impacts upon household 
income, it has not yet transferred labor toward non- 
farming activities as the government wished [36]. 
Payments are now being adjusted to improve success 
in achieving goals of poverty alleviation and growth of 
new economic sectors in rural areas. 

China is also now establishing a new network of 
Ecosystem Function Conservation Areas (EFCAs), spe¬ 
cifically for ES provision. Their exact delineation is now 
being determined through quantitative ecosystem 


service mapping and valuation. They are expected to 
span ca. 25% of the country. 

The current and potential future impacts of ES 
investments in China are enormous, certainly within 
the country - and also globally, in the form of enhanced 
carbon sequestration and reduced dust export, and 
perhaps most importantly in lessons on making the 
investments needed in natural capital and human 
well-being everywhere. 

International Scale: Global Policy and Research 
Efforts 

As described above, the MA was the first major effort to 
establish ES in the international policy arena. Activities 
stemming from that effort are now aimed at bringing 
countries together in making tangible commitments to 
safeguard ES (e.g., 2020 targets for the Convention on 
Biodiversity) and to assess national and international 
progress toward those commitments (e.g., through 
Group on Earth Observations Biodiversity Observa¬ 
tion Network (GEO BON) and the Programme on 
Ecosystem Change and Society (PECS), which synthe¬ 
size knowledge for the International Platform for Bio¬ 
diversity and Ecosystem Services (IPBES), formally 
established in 2010). Several new international research 
efforts aim to feed into these international processes, 
including the Natural Capital Project, The Resilience 
Alliance, and the Stockholm Resilience Centre. Other 
entities are focused on establishing and tracking ES 
markets, as a mechanism for bringing larger attention 
to ES benefits to society (e.g., The Katoomba Group 
and The Ecosystem Marketplace, both initiated by 
Forest Trends). As an example of many burgeoning 
international efforts, we describe in greater detail the 
Natural Capital Project. 

The Natural Capital Project (NatCap) (www. 
naturalcapitalproject.org) is an international partner¬ 
ship working to align economic forces with cons¬ 
ervation, by developing tools that make incorporating 
natural capital into decisions easy and replicable; by 
demonstrating the power of these tools in important, 
contrasting places; and by engaging leaders globally. 
NatCap is developing InVEST, a family of tools for Inte¬ 
grated Valuation of Ecosystem Services and Tradeoffs. 

InVEST helps decision makers visualize the impacts 
of potential policies by modeling and mapping the 
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delivery, distribution, and economic value of ES under 
alternative scenarios (for more information, see [30]). 
The outputs identify trade-offs and compatibilities 
between environmental, economic, and social benefits. 
InVEST is designed for use as part of an active decision¬ 
making process (Fig. 4) and can be applied at local, 
regional, or global scales. The first phase of the 
approach involves working with stakeholders to iden¬ 
tify critical management decisions and to develop sce¬ 
narios that project how the provision of services might 
change in response to those decisions as well as to 
changing climate, population, etc. Based on these sce¬ 
narios, a modular set of models quantifies and maps 
ES. The outputs of these models provide decision 
makers with information about costs, benefits, trade¬ 
offs, and synergies of alternative investments in ES 
provision. 

NatCap is using InVEST in major natural resource 
decisions in diverse contexts around the world, 
including in the three examples given above (water 
funds, coastal and marine spatial planning, and land- 


use planning and human development in China). 
The aim is to demonstrate the power of these 
approaches and to learn how to replicate and scale 
up models of success. The Project is engaged in a 
suite of international efforts, including GEO BON 
and IPBES, to offer a common, unifying platform for 
regional and national efforts that are spawned by these 
initiatives. 

Future Directions 

With the rapid rate of development of ES mapping, 
from the biophysical and economic modeling through 
to policy application in diverse socioeconomic con¬ 
texts, it is likely that great advances will be made in 
coming years. What we report here is only a beginning. 
There are key arenas in which further learning is crucial 
to understand what drives variation in the provision of 
ES, how they percolate through various arms of society, 
and how social reaction leads to sustainable or 
unsustainable change in ES provision. 
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An iterative process for integrating ecosystem services into decisions. The process begins with stakeholder engagement 
around impending decisions, with a focus on realistic, alternative scenarios for the future. The modeling is shaped by 
stakeholders, and typically focused on the services and scenarios deemed most important. Outputs are displayed in 
accordance with stakeholder preferences, in the form of maps, trade-off curves, and/or balance sheets. These can be 
expressed in biophysical (e.g., tons of carbon), economic (e.g., dollars), or cultural (e.g., visitor-days) terms 
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Relating Ecosystem Services and Human Health 

The relationships between biophysical attributes of 
ecosystems and human health are complex [41]. 
Destruction of natural ecosystems can at times improve 
aspects of public health. Draining swamps, for exam¬ 
ple, can reduce habitat for the mosquito vector that 
transmits the parasite that causes malaria. On the other 
hand, destruction of other systems can have sharp 
negative consequences for human health. There is 
emerging evidence that loss of tropical forests, for 
example, leads to an increase in transmission of malaria 
[1, 50]. Similarly, fragmentation of, and biodiversity 
loss from, eastern North American forests is associated 
with an increase in lyme disease [32]. 

Natural and managed ecosystems provide many 
services that sustain human health, through provision 
of human nutrition (especially of protein and 
micronutrients); purification and regulation of drink¬ 
ing water; regulation of air quality; regulation of 
vector-borne disease; and psychological benefits. 
There is a great need for research illuminating the 
links between biodiversity, ecosystem conditions and 
processes, and human health. 

Trade-offs and Synergies 

The relationship between ES and biodiversity and 
among different ES varies with socio-ecological context. 
In some cases, clear trade-offs and synergies among 
services have been defined in specific contexts [54], but 
there is still much to be learned about what determines 
the nature of these relationships. Advancing this knowl¬ 
edge is essential because policies addressing manage¬ 
ment change can only be successful if management 
controls ES relationships. If policies are established to 
align multiple ecosystem services, but biophysical con¬ 
ditions in the system lead to innate trade-offs among 
services, management changes are bound to fail in 
delivering the desired improvements to social benefits. 

Distributional Effects 

Much of the science of mapping ES has focused on 
identifying where ES are generated and where they are 
delivered. However, less work has focused on identify¬ 
ing to whom ES actually flow. This connection is essen¬ 
tial if policies addressing ES delivery are to be equitable 


and either improve the well-being of the poor or avoid 
unintended distributional consequences. Past work in 
this arena has focused on overlaying maps of ES provi¬ 
sion with an array of poverty indicators (e.g., [68]). 
Missing from this spatial analysis is information on 
access to and ability to control the delivery of ES. In 
many cases (e.g., for services such as clean drinking 
water, hydropower production, agriculture, water for 
irrigation, wave power generation), the actual delivery 
of services to specific people is affected by the location 
of infrastructure or institutions regulating access to 
resources. New science is needed that allows the ready 
mapping of these connections and the prediction of 
how they will change under future conditions. 

Dynamic Effects: Shocks and Uncertainty 

Dynamic changes, such as in climate and in the nitrogen 
cycle - as well as changes arising through economic 
development and evolving human preferences over 
time - are very important. The possibility of feedbacks 
within ecosystems, and between ES and human behavior, 
is a key area for further development. Feedback effects 
can give rise to thresholds and rapid changes in systems 
that can fundamentally alter system outcomes [60]. 
The ability to incorporate shocks and the possibility 
of surprises is another area where further development 
is needed. Fires, droughts, and disease all can have 
major influences on ecosystems and affect the services 
produced. Changes in economic conditions or fads in 
human behavior can similarly cause major changes in 
systems (e.g., financial crises). The occurrence of each 
of these and other potential disturbances is difficult to 
predict but virtually certain to come about. Under¬ 
standing their likely impacts on ecological and social 
systems will help us prepare for them. 

Valuation in Monetary and Non-monetary Terms for 
Decision-Making 

Monetary valuation of ES is not nearly as prevalent as 
sometimes assumed. More typically, real-world appli¬ 
cations of the ES framework rely on biophysical values 
to inform policy design, such as measures of water 
quality or flood risk. 

Value is not always easily characterized or fully 
captured in monetary terms, so it is important to 
characterize value in multiple dimensions, including 
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health, livelihood support, cultural significance, 
etc. (e.g., [17]). This will help ensure that valuation 
and broader decision-making approaches are inclusive 
of the range of benefits and people concerned [28]. 
Interdisciplinary efforts are presently underway to 
create a conceptual framework that is useful both in 
theory and in practice for a broad suite of cultural ES. 

Institutional Design 

However ES are measured, there is a need for political 
and social science research to design institutions and 
policy mechanisms that better capture externalities. 
Efforts such as national accounts are blossoming now, 
but it is unclear how they will evolve and how success¬ 
ful governments will be at incorporating natural capital 
into national measures of wealth. There is great work to 
be done in determining the merits and limitations of 
alternative policy and finance mechanisms, in different 
economic, governance, and other social contexts (e.g., 
[5, 46, 58, 59]). There is also great work to be done in 
developing institutions that achieve representation and 
participation by stakeholders as part of adaptive gov¬ 
ernance systems (e.g., [13, 55]). 

Conclusions 

Ecosystem services have had a relatively long history 
through indirect recognition of the importance of 
nature for the persistence of the human endeavor. 
There are scientific challenges for biogeochemists, 
hydrologists, ecologists, economists, anthropologists, 
and other social scientists to understand how human 
actions affect ecosystems, the provision of ES, and the 
value of those services. At least as demanding are the 
social and political questions associated with incorpo¬ 
rating this understanding into decision-making. There 
is also a need to design effective and enduring institu¬ 
tions to manage, monitor, and provide incentives that 
reflect the social values of ecosystem services. Informa¬ 
tion is becoming more readily available for individuals, 
corporate managers, and government officials who 
make decisions affecting ecosystems and the services 
to consider a more complete set of costs and benefits 
associated with their choices. We are likely to see con¬ 
tinuing growth in our scientific ability to measure and 
predict changes in ES, our ability to design policies 
and institutions that accurately represent these changes 


and in turn, the ability of the environment to continue 

providing the many benefits society needs to prosper. 
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Glossary 

Disturbance A spatial process or event that reverts 
forest vegetation to early successional stages typi¬ 
cally altering forest structure and composition. 
Ecotone A region of interface between two communi¬ 
ties, ecosystems, or biogeographic regions. 

Legacy A persisting spatial feature or pattern that was 
generated by a historical disturbance. Legacies can 
constrain the spatial dynamics of contemporary 
disturbances. 

Multi-scale analysis A method of spatial analysis that 
looks at the relative contributions of different scales 
of spatial pattern to a single observed spatial 
pattern. 

Pattern A repeatable and identifiable feature in a spa¬ 
tial context. 

Scale An attribute of a spatial process or data used to 
represent that process that describes its spatial 
dimensions. Scale includes elements of grain, 
extent, and thematic resolution. 

Spatial autocorrelation The degree of correlation of 
a variable and itself as a function of the spatial 
distances among sample points. 

Stationarity A feature of a spatial process in which the 
mean and variance of a process is consistent across 
the extent of a study area. 

Variography A geostatistical modeling tool for 
describing spatial variance and semivariance as 
a function of spatial distance among pairs of points. 

Definition of the Subject 

Ecological processes such as forest disturbances act on 
ecosystems at multiple spatial and temporal scales to 
generate complex spatial patterns. These patterns in 
turn influence ecosystem dynamics and have important 
consequences for ecosystem sustainability. Analysis of 
ecosystem spatial structure is a first step toward 
understanding these dynamics and the uncertain 
interactions among processes. There are many spatial 
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statistics available to describe and test spatial pattern 
within ecosystems and to infer the character of the 
processes that generated them. Indeed, improving 
understanding of the processes that create spatial pat¬ 
tern is a central objective of spatial pattern analysis. In 
addition to standard tests of spatial autocorrelation 
and patch structure, methods for multi-scale decom¬ 
position of spatial data and identification of 
stationarity are necessary to determine the key spatial 
scales at which the processes operate and affect ecosys¬ 
tems and to identify meaningful spatial subunits within 
larger contexts. Finally, tools for identifying ecosystem 
boundaries are also important to monitor boundary 
movement and changes in local ecosystem characteris¬ 
tics through time. 

Introduction 

Spatial Patterns in Ecosystems 

The spatial structure of ecological systems is important 
to examine and understand as spatial structure medi¬ 
ates the flows of individuals, materials, and informa¬ 
tion through space and time [1]. These flows bear on 
the probabilities of occurrence and persistence of floral 
and faunal populations which determine local and 
regional biological diversity as well as ecosystem 


functioning [2]. Interruptions and alterations of such 
flows within and among ecosystems in terms of rate, 
quantity, or both as a result of human interventions or 
natural dynamics such as disturbance can have impor¬ 
tant consequences for ecosystem sustainability and 
long-term population persistence. Quantitative char¬ 
acterization of spatial patterns and their rates of change 
in natural environments is essential to understanding 
ecological processes and to inform sustainable manage¬ 
ment techniques that aim to minimize degradation and 
alteration of ecosystem dynamics [3]. 

Spatial pattern, or simply spatial structure, refers to 
a quantifiable attribute of a spatial context. General 
definitions of the word pattern include a simple defi¬ 
nition such as a distinctive or regular “form” or “order,” 
or a feature that is repeated with some degree of “reg¬ 
ularity” [4]. Recently, Wagner and Fortin [5] defined 
the more general term “spatial heterogeneity” as spa¬ 
tially structured variability in a property of interest. 
Both exogenous environmental (e.g., edaphic variabil¬ 
ity, elevation, climate) and endogenous ecological (e.g., 
species interactions, pollen, and seed dispersal) pro¬ 
cesses generate spatial structure. Each of these two 
types of spatial processes can produce spatial pattern 
in multiple forms and scales (Fig. 1). The simplest form 
of spatial pattern is a simple gradient (Fig. la). Spatial 


E 


Gradient 


a 

Gradient + 
Patches 


b 

Gradient + 

Small 

Gradients 


c 



Large Gradient + 
Small Gradients + 
Linear Features 


d 

Large Gradient + 
Small Gradients + 
Linear Features + 
Strata 

e 

Large Gradient + 
Small Gradients + 
Linear Features + 
Strata + Species 

f 



Ecosystems and Spatial Patterns. Figure 1 

Spatial heterogeneity as a series of additive processes resulting in additive spatial patterns to which individual 
organisms (here represented points) may respond. Some or all of the different types of spatial heterogeneity may be 
present in any given landscape 
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structure can also be present in the form of patches, 
linear features, and points and can be superimposed on 
a gradient (Fig. lb-f). When biological spatial struc¬ 
ture is mostly responding to environmental conditions 
such as those depicted in Fig. 1, the resulting spatial 
structure is said to have spatial dependency to the 
environmental factors. However, when spatial struc¬ 
ture emerges as a result of interactions among ecolog¬ 
ical processes, the pattern is said to be spatially 
autocorrelated [5]. 

Spatial patterns and heterogeneity can also be 
defined using spatial and topological characteristics. 
These characteristics can include, but are not limited 
to, a patterns intensity, autocorrelation, degree of clus¬ 
tering, variability, and scale, which itself includes spatial 
grain and extent [6]. Importantly, a single pattern sum¬ 
marized using different characteristics can result in 
different interpretations of the processes behind that 
pattern [7, 8]. 

Because spatial pattern analysis is often interested 
in inferring the processes that created them, it is impor¬ 
tant to recognize that any single observed pattern rep¬ 
resents but one realization of the stochastic process (es) 
that generated it [7, 9]. By acknowledging that an 
observed pattern is but a single “snapshot,” its temporal 
dimension is recognized and that under different cir¬ 
cumstances, the patterns seen may not be exactly the 
same. Hence, a main objective of studying spatial pat¬ 
tern is to try to tease apart stochastic processes and the 
patterns they create from their spatiotemporal 
conditionalities. 

In addition to being driven by processes that are 
stochastic, patterns emerge as a result of multiple pro¬ 
cesses that operate at different spatial and temporal 
scales [10, 11]. These processes can be biotic or abiotic 
and are usually interconnected through dynamic, and 
occasionally nonlinear, feedback loops. For example, 
emergent spatial pattern following forest fires is condi¬ 
tional on the initial distribution of forest fuels as well as 
fire-weather conditions [12]. Similarly, patterns in for¬ 
est vegetation composition are often related to patterns 
in abiotic factors such as moisture, drainage, and soil 
conditions. Patterns in the genetic composition in ani¬ 
mal populations have also been shown to be influenced 
by the environmental variation (e.g., suitable vs. 
unsuitable habitat) between sampled populations 
[13]. These relationships are often nonlinear as 


the patterns that result from the interactions among 
pattern-generating processes tend to be different than 
any one process on its own [14, 15]. 

Novel spatial patterns created by contemporary 
anthropogenic processes have uncertain consequences 
for natural ecosystem dynamics. Anthropogenic pro¬ 
cesses including deforestation, development, land use 
change, and climate change do not replace natural 
processes, but have the capacity to interact with and 
alter them. As such, a significant question in modern 
ecology and ecosystem science is that of what are the 
effects of such novel patterns and processes on natural, 
or historical, system dynamics [16-18]. Not only do 
new sources of spatial variability influence natural 
dynamics through changing the patterns to which nat¬ 
ural processes respond, but they also can alter the 
processes themselves. For example, with regard to for¬ 
est fire dynamics, this is true where forest composition 
has been changed due to fire suppression and manage¬ 
ment (pattern change) and fire frequencies are 
increased due to increased ignitions near roads or 
changes in local weather patterns (process change). 
Similarly, with regard to animal population dynamics, 
movement and dispersal may be impeded through 
habitat loss and fragmentation (pattern change) and 
habitat loss can have an absolute effect on effective 
population size, rates of dispersal, and genetic variabil¬ 
ity (process change). Sophisticated spatial statistical 
analyses are required to begin to disentangle the con¬ 
tributions of different processes to observed spatial 
patterns to understand how best to manage natural 
systems to safeguard against further habitat-related 
losses to biodiversity [19]. 

Here the causes and consequences of spatial patterns 
in terrestrial forest ecosystems are reviewed with partic¬ 
ular emphasis on patterns of forest vegetation generated 
through landscape level disturbance processes. Spatial 
patterns in forest vegetation are both ecologically and 
economically important in that they are directly relevant 
to wildlife habitat supply, timber supply, future distur¬ 
bance dynamics, and represent future challenges to for¬ 
est and land managers. Uncertainty regarding future 
disturbance dynamics, in particular fire and insects out¬ 
breaks, in the context of global climate change makes 
investigations into disturbance interactions and poten¬ 
tial long-terms consequences for ecosystem spatial 
structure and functioning particularly relevant. 
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Forest Ecosystems 

In North American forests, disturbance processes gen¬ 
erally include landscape level fires, insect outbreaks, 
forest management (i.e., logging), and fine-scale local 
disturbances such as windthrow and fungal diseases. 
The patterns created through the interactions among 
disturbances can have important economic and ecolog¬ 
ical consequences. For example, Stadler et al. [20] dem¬ 
onstrated that hemlock wolly adelgid ( Adelges tsugae ) 
infestations in New England can affect both fast and 
slow ecosystem dynamics, nutrient cycling dynamics in 
the short term, and landscape-scale patterns of forest 
composition in the long term. Similarly, compounded 
disturbances (e.g., fire and logging) in the eastern 
boreal forest can result in alternate forest states [21], 
which can have consequences for biodiversity conser¬ 
vation. Economically, it has been clearly demonstrated 
that forests under risk of disturbance, either through 
fire or insect outbreaks, required longer rotation 
periods to accommodate for the losses [22]. 

Logging, fire, and insect outbreaks represent distur¬ 
bance processes that revert forest stands to early serai 
stages. Succession describes processes of forest recov¬ 
ery, regeneration, and change that vary in response to 
different disturbances. Although multiple processes 
generate forest spatial heterogeneity, not all influence 
it in the same way. Spatial disturbance legacies vary in 
terms of shape, size, intensity, boundary characteristics, 
influence on forest succession, and effects on forest age 
structure [18, 23-25]. The interactions among pro¬ 
cesses, or more properly, interactions among current 
disturbance and existing spatial legacies, create and 
maintain heterogeneous forest landscapes. This cas¬ 
cade of effects and constraints creates mutual dynamic 
feedbacks among patterns (spatial legacies) and spatial 
processes (disturbances) [26, 27] with important con¬ 
sequences for ecosystem dynamics. 

Different forest disturbances create different forms 
of spatial structure. Indeed, each disturbance imposes 
its own unique “spatial signature” on the landscape 
that also has different temporal characteristics contin¬ 
gent on a disturbance’s interaction with succession 
(Fig. 2). Fires, for example, tend to produce relatively 
discrete patches that occur over a short time frame and 
vary in terms of the residual forest structure that is left 
behind [28]. Logging is somewhat similar to fires in 


that the patches created are discrete and occur over 
short time frames and forest managers have control 
over the scale and amount of residual structure. Insect 
disturbances, such as outbreaks of spruce budworm 
(Choristoneura fumiferana ), forest tent caterpillar 
(Malacasoma disstri ), and the mountain pine beetle 
(Dendroctonus ponderosae ), are less discrete and tend 
to produce more complicated spatial structure and 
continue to affect forest structure at a given location 
for multiple consecutive years [29]. 

Each disturbance also has a unique relationship 
with forest regeneration processes [30] such that forest 
succession is tightly coupled to the type of disturbance 
that reinitiates stand development. These relationships 
determine future forest structure. Historically, fire and 
insects were the main disturbances in North American 
forest systems. Adaptations to disturbance such as 
serotiny in pine species (e.g., jack pine; Pinus 
banksiana ) and advanced regeneration in the under¬ 
story of spruce (e.g., Picea spp.) and balsam fir ( Abies 
balsamea) stands that maintain spruce budworm host 
availability over time [31] are evidence of this dynamic 
feedback between disturbance and succession. The spa¬ 
tial patterns created through forest management and 
their influences on forest succession in turn influence 
future forest disturbances dynamics [32]. Spatial pat¬ 
tern analysis is important to better understand the 
effects of human activities on natural disturbance 
dynamics. 

Sources of Heterogeneity 

Understanding the nature and consequences of spatial 
heterogeneity in ecosystems requires an understanding 
of the processes that generate this heterogeneity. In this 
section, different types of spatial heterogeneity and 
how different types of processes may give rise to com¬ 
plex spatial patterns are described. The consequences 
and potential challenges involved in indentifying the 
relative contributions of these different and frequently 
interacting processes are discussed next [6]. 

Levels of Organization 

Processes that generate spatial heterogeneity can be 
classified into a hierarchy of spatial processes that 
operate at different levels of ecological organization: 
(1) individual, (2) population, (3) community, and 
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Ecosystems and Spatial Patterns. Figure 2 

Spatial and temporal scales of forest disturbances. Interactions among disturbances are dependent on the unique 
successional responses to each disturbance (row 7). Historical forest systems were governed by interactions mainly 
between fire and insects ( arrows ) although presently, logging also interacts with these historical processes. Columns show 
the unique spatial and temporal attributes of each of the three main boreal forest disturbance agents: fire, insects 
(i.e., SBW), and harvesting (i.e., logging). These different spatial features result in different realization, or spatial signatures, 
of each disturbance {row 4). Interactions among these different processes produce a single observed spatial realization of 
spatial structure that contains elements of each of the different processes {row 5). Observed patterns contain elements 
of all three main processes; the objective of spatial analyses is to begin to tease apart the relative contributions of different 
processes to observed spatial pattern {row 6) 


(4) landscape/ecosystem. Individual processes include 
organism dispersal and habitat selection; population 
processes can include demographic dynamics as well as 
immigration/emigration; community level processes are 
highly relevant to natural disturbance dynamics and can 
include successional changes and rates of species turn¬ 
over. Examples of landscape/ecosystem level processes 
include disturbance, climate change, and migration. 

Processes within this organizational hierarchy are 
not necessarily independent and can influence each 
other among levels. Such interactions can influence 
emergent patterns due to potential cross-scale 


interactions and amplifications [33] and can also 
further complicate efforts to identify clear cause-and- 
effect relationships. A recent example of such cross¬ 
scale amplification in a forest ecosystem can be found 
within the mountain pine beetle (Dendroctonus 
ponderosae) system of Western Canada, where the 
recent outbreaks of the lodgepole pine infesting 
beetle have affected an unprecedented millions of hect¬ 
ares [34]. Here, the local dynamics of population 
control by host tree defenses were overcome when 
population numbers increased dramatically due to 
persistent warmer temperatures in the early 2000s. 
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The interactions between these local- and landscape- 
level processes are thought to have led to a positive 
feedback that allowed the outbreak to expand as 
much as it did [34]. 

Interactions Among Spatial Processes 

Spatial processes within ecosystems interact with each 
other directly as well as with the spatial legacies of 
previous, historical processes. In this way, spatial pat¬ 
terns and processes are connected through a dynamic 
and persistent feedback loop [32]. Spatial legacies can 
be thought of as a form of spatiotemporal connectivity 
among disjunct spatial processes or events that are 
mediated by forest succession and aging. The legacies 
of historical processes as represented by contemporary 
patterns can have long-lasting and significant impacts 
on biodiversity [ 16, 35] and efforts to sustainably man¬ 
age forest ecosystems [23, 36]. 

Spatial legacies can be defined at different scales and 
describe persisting features within a stand, landscape, 
or ecosystem. The term “legacy” can refer to fine-scale 
structural complexity following windthrow [37], land¬ 
scape level forest age structure [18, 38], disturbance- 
mediated seed availability [39], and residual forest 
structure following fire [39, 40]. Because patterns of 
historical land use can influence contemporary ecosys¬ 
tem composition, configuration, and ecosystem pro¬ 
cess dynamics long after the actual event [41], a better 
understanding of spatial legacies and their influence on 
ecosystem dynamics and landscape change over time is 
needed and requires novel spatial and temporal 
methods of investigation and analysis. 

From the perspective of sustainability, spatial dis¬ 
turbance legacies, including those created through 
human activities, represent future ecosystem patterns 
and future challenges for sustainable management. 
Gustafson et al. [42] showed that new forest harvest 
goals are not easy to achieve due to existing conditions 
when examining shifting forest management rules. 
Wallin et al. [38] demonstrated that shifts from 
a dispersed to an aggregated harvest pattern did not 
immediately result in a change in forest attributes such 
as patch size and edge density. Instead, new harvest 
rules had to work around the legacies of previous 
patterns, and original patterns were enforced. Similarly, 
Gustafson and Rasmussen [43] found that when vary¬ 
ing parameters in a harvest simulation model, the 


persistent legacies of previous harvest patterns resulted 
in timber harvest shortfalls. Using a simulation 
approach, James et al. [18] demonstrated that legacies 
in forest age structure created through forest manage¬ 
ment can persist for over 100 years. Ecologically, the 
consequences of these legacies interacting with new 
disturbances can result in greater system variability 
and gradual ecosystem degradation [43-45] or alterna¬ 
tive stable states [21, 46]. 

Multiple Spatial Scales 

Inferring the characteristics of spatial processes 
through analysis of spatial pattern is a central goal of 
most ecological studies as it is often very difficult to 
analyze the processes of interest directly. This can be 
particularly challenging when several types of pattern 
(Fig. 1) and underlying processes are present (Fig. 2). 
The challenge resides in the fact that the single observed 
pattern is an amalgamation of these multiple processes 
interacting with existing spatial structure and historical 
legacies; the functional relationships that connect these 
contributors to pattern are largely uncertain (Fig. 2). 
When analyzing the spatial structure of sampled data, it 
is not easy to disentangle the key spatial scales, and 
therefore processes, that act on the data. However, in 
the last decade, hierarchical decomposition methods 
(multi-scale ordination [47]; PCNM [48]; wavelets, 
[49] more detailed below) have been developed to 
identify the spatial scales at which data are most 
strongly structured and to decompose the data on the 
basis of scale-specific variances. Beyond simply 
describing patterns and the scales at which they are 
structured, it is also important to have a priori hypoth¬ 
eses about which scales and processes are the most 
relevant for the questions under study as these methods 
could reveal many patterns and spatial scales, many of 
which may not be of relevance [8]. Spatial pattern 
analysis will be more effective at describing underlying 
processes when used in an explicit and informed 
hypothesis testing framework. 

Ecological Consequences of Spatial Heterogeneity 

The consequences of changes in spatial pattern in forest 
landscapes are easily confounded by absolute losses in 
wildlife habitat [19]. That is, although both forest 
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composition and configuration are important, issues 
related to configuration are only relevant below 
a critical threshold of forest amount (usually 20-30% 
of area; [50]). Above the critical threshold, the land¬ 
scape generally remains “connected” and organisms or 
disturbances can spread in the landscape [51]. Below 
such a critical threshold, species’ response to the 
amount of habitat area is nonlinear as most species 
do not have enough habitat to meet their needs. Sur¬ 
rounding habitat quality (composition) and configu¬ 
ration become more important for local population 
persistence in this case. Moreover, fragmented land¬ 
scapes with various landcover types can impede species 
abilities to move from habitat patches to another [52]. 
For example, nesting birds do not cross forest gaps 
larger than 25 m [53]. Impediments to movement 
across landscapes can influence population dynamics 
[54] as well as genetic heterogeneity [55] , both of which 
affect the probability of population persistence. 

Spatial Analyses 

There are three main approaches to investigating the 
different aspects and consequences of spatial heteroge¬ 
neity. Spatial statistics, landscape metrics, and statisti¬ 
cal modeling, all approach the question of identifying 
spatial pattern in ecosystems in a slightly different way 
[5]. Owing to the varied history of approaches to 
studying spatial patterns including methods and con¬ 
cepts drawn from a diverse set of disciplines such as 
geography, geostatistics, and ecology, only the key con¬ 
cepts related to the most commonly used methods that 
quantify spatial structure within and among ecosys¬ 
tems are presented. Before describing the different 
types of analysis, some fundamental issues related to 
the spatial analysis of data are presented. 

Assumption of Stationarity 


when more than one process is present and that these 
multiple processes may be acting at different spatial or 
temporal scales. Yet, in most forest ecosystems, pro¬ 
cesses interact with one another, resulting in unique 
types and scales of spatial pattern which violate the 
assumption of stationarity. In such circumstances, it 
is required to first identify stationary subregions within 
such a larger spatial context. A few spatial analysis 
methods do not require the stationarity such as 
lacunarity analysis, local quadrat variance methods, 
and wavelets [7]. It is worth noting that these types of 
analyses although different and originating from dif¬ 
ferent developmental histories are quite similar to one 
another mathematically [56]. 


Data Type 

Spatial pattern within ecosystems can be represented 
using categorical or continuous data depending on the 
nature of the variable under investigation. Each type of 
data requires different methods of analysis (Fig. 3; [7]). 


Gradient Analysis 
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• Spatial Autocorrelation 

• Range 

• Anisotropy 
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• Composition 
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patch orientation, spatial 
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To infer spatial pattern from samples spatial statistics 
require that the area under study is governed by the 
same underlying process (i.e., the assumption of 
stationarity; [7]). As it is often impossible to be sure 
that the underlying process is stationary, one needs 
either to assume it or to determine whether or not the 
observed data are stationary (i.e., their statistical prop¬ 
erties such as mean, variance, isotropy do not vary with 
spatial distance). Nonstationary processes may arise 


Ecosystems and Spatial Patterns. Figure 3 

Spatial analysis can be undertaken on different types of 
spatial data, (a) Raster-based quantitative spatial data (e.g., 
forest height, basal area, NDVI) that can be analyzed using 
spatial statistics to determine the intensity, spatial range, 
and directionality (anisotropy) of the spatial pattern. 

(b) Categorical and qualitative forest data (e.g., species, 
stand age) require a different analytical approach that 
typically includes landscape pattern metrics 
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Categorical data can be described by the amount and 
configuration of the different discrete types on the 
landscape. Examples of categorical spatial data include 
forest type and age, or classified habitat patches. Both 
amount and configuration can be described in numer¬ 
ous ways using landscape pattern metrics [9, 57, 58]. 
Continuous data requires more subtlety in describing 
patterns and can include variables such as soil mois¬ 
ture, forest basal area, or remotely sensed reflectance 
indices (e.g., NDVI). Composition of continuous vari¬ 
ables can be described using the density distribution of 
the variable and configuration is usually described by 
a spatial covariance function that captures the strength, 
directionality, and scale of autocorrelation of the vari¬ 
able [5, 7, 59]. 

In addition to the categorical/quantitative dichot¬ 
omy of data types, patterns can be described using 
different geometric topologies of spatial features or 
units: the vector format (points, lines, polygons) and 
the raster (i.e., pixel) format [60]. Representation of 
spatial structure in one form of data does not preclude 
use of another form. For example, annual polygons of 
insect defoliation data can be converted into raster 
form and analyses can be undertaken using time series 
of raster values at a specific location [61] . Furthermore, 
binary rasters (presence/absence) can be converted to 
continuous rasters by increasing the cell size and 
counting the number of “presence” pixels surrounding 
a focal pixel using 4-neighbour, 8-neighbour, 16-neigh¬ 
bour rules and assigning this value to the to the new, 
larger pixel. 

Raster data can be used to represent any continuous 
variable. In contrast to point data, in raster data types, 
the information fully covers the extent of the study 
area. There is also a unique grain (or cell size) to each 
raster “pixel” that determines the subarea of continu¬ 
ous space that is discretized by the raster cell. The 
selection of raster grain can have important conse¬ 
quences to the results of spatial analyses [6]. Raster 
data can be used to represent any number of spatial 
variables relevant to disturbance ecology including, but 
not limited to, tree species [62], stand age [18, 63], 
basal area [64], insect damage [61], and number of 
fire occurrences [65]. The continuous coverage of ras¬ 
ter data makes it amenable to many different analytical 
techniques such local quadrat variance, lacunarity, and 
wavelets [56]. 


Spatial Analyses within an Ecosystem 

Ecological variables that are geographically distributed 
in space and time tend to be more similar when com¬ 
pared close together [66]. Autocorrelation is a feature 
of most data and can be quantified by the degree of self¬ 
similarity or dissimilarity in a variable between pairs of 
locations at a given distance apart (i.e., spatial lag 
determined in terms of equidistant classes). Note that 
spatial statistics on their own cannot differentiate 
between spatial dependence to environmental factors 
and spatial autocorrelation due to ecological processes; 
only prior knowledge and multiple testing can differ¬ 
entiate between these two sources of spatial structure. 

Spatial Description of the Pattern 

The objective of many spatial statistics is often to char¬ 
acterize to what degree spatial data are autocorrelated, 
if they are oriented in a particular direction (anisot¬ 
ropy), and at what scale. As these spatial statistics 
have been thoroughly reviewed elsewhere [7, 58], we 
focus on three topics (1) methods of spatial pattern 
analysis devoted to identifying structure in point data, 
(2) methods of spatial analysis that are devoted to 
identifying structure and pattern in two-dimensional 
raster (pixel) or polygon data, and (3) methods of 
spatial analysis explicitly concerned with identifying 
the scale, or scales of structure that are present in either 
point or two-dimensional data. Both the data types 
discussed can be examined in uni-, bi-, and multivar¬ 
iate contexts and can include either categorically or 
continuously measured variables. 

Point Pattern Analysis 

Spatial point processes describe phenomena that pro¬ 
duce events represented as points in space [67, 68]. The 
objective of point pattern analysis is to determine 
whether the distribution of events (points in space) is 
more or less spatially aggregated than is expected by 
chance and tests the null hypothesis of complete spatial 
randomness. The use of complete spatial randomness 
assumes that the underlying process is the same over 
the study area (i.e., stationarity). When it is not the 
case, the study area is said to “inhomogeneous” such 
that significance cannot be achieved using a single pro¬ 
cess such as complete spatial randomness. Modified 
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statistics and corrections have been developed to 
account for inhomogeneity within the study area 
[69, 70]. Point pattern analysis also assumes complete 
census of all point occurrences in the study area [7]. 
Fortin et al. [71] showed that the significance of spatial 
aggregation estimations is biased only when a subset of 
sample points is used rather than the entire set of 
points in the study area. 

The most commonly used method of point pattern 
analysis is Ripley’s K statistic [72]. New statistics have 
also been developed to compute local estimates [68]. 
Ripley’s K statistic, and derived statistics, can be 
applied in one-, two-, or three-dimensional space to 
compare the degree of aggregation of points [68]. In 
cases where two (or more) point processes are operat¬ 
ing, it may be of interest to assess whether one process 
influences the other and whether points of different 
types tend to cluster together. Examples of ecologically 
relevant spatial point processes include fire occurrence, 
plant occurrences [67], or the distribution of animal 
nesting or denning sites [72]. The bivariate, or cross -K, 
Ripley’s K test assesses whether the co-occurrence of 
two types of points is clustered together more or less 
than is expected by chance [58]. Using this technique, 
Lynch and Moorcroft [73] examined co-occurrence of 
fire and insect outbreaks and found that contrary to 
expectation, insect-caused forest mortality does not 
increase the risk of forest fire. 

Spatial Autocorrelation 

Often, a researcher is interested in determining the 
scale and strength of spatial autocorrelation of 
a variable as well as whether there is a directional 
trend in the data (i.e., anisotropy). This can be achieved 
using spatial autocorrelation coefficients such as 
Moran’s I which computes the product of the devia¬ 
tions of the values of the variable to its average 
according to various distance intervals (lags, classes) 
standardized by the variance at that spatial lag [7]. 
Moran’s I behaves like a Pearson’s correlation coeffi¬ 
cient such that the null hypothesis is the absence of 
spatial autocorrelation, positive autocorrelation 
(mostly a short distance) indicates that values have 
comparable values, while negative values indicate that 
the values are very dissimilar. Moran’s I assumes that 
the underlying process is the same over the entire study 


area (i.e., stationarity). Hence, the spatial autocorrela¬ 
tion coefficients computed at various distance classes 
are average values. Spatial autocorrelation in this sense 
can be referred to as a global spatial statistic that 
describes an attribute of the data over the entire study 
area [7]. Significance of each coefficient can be com¬ 
puted based on an asymptotic t or randomization 
procedure. In either case, stationarity is required. 
Moran’s I is very sensitive to skewed data as the mean 
will be biased and in consequence all the deviations 
values based on it will also be biased. It is therefore 
recommended to check the distribution of the data 
before computing spatial autocorrelation and if needed 
transform the data to obtain a symmetric distribution. 

Measures of spatial autocorrelation are also sensi¬ 
tive to sample size. When autocorrelation is estimated 
using too few locations, (e.g., <30 positions), spatial 
patterns may not be detected, even though present. 
Similarly, depending on the spacing among sampling 
locations, there will be different numbers of paired 
comparisons at each lag distance. Typically, there are 
few pairs at short distances due to the edge effects of the 
edge of the study area (no locations outside to compare 
too), most of the pairs at intermediate distances and 
very fewer at large distances (because of the overall size 
of the study area). To mitigate these unequal numbers 
of pair per distance lag, it is recommended to focus on 
lags distances equal to the length of the first half (up to 
two thirds) of the smallest edge of the study area as 
most spatial autocorrelation occurs at short distances 
and that the probability of detecting it is also highest in 
the first spatial lag [7]. 

A plot of spatial autocorrelation coefficients against 
distance lags is called a spatial correlogram. If the lags 
are based on distance only, the correlogram is said to be 
an omnidirectional correlogram. When the data con¬ 
tain directionality, the omnidirectional correlogram 
cannot reveal it and may, in fact, “mask” it. To detect 
the presence of anisotropic spatial pattern (i.e., not 
having the same sill and range according to direction), 
the samples need to be divided by distance class as well 
as direction angle range (usually 0°, 45°, 90°, 135°) to 
produce a set of directional correlograms. 

In areas where several processes influence ecological 
data, Moran’s I that assumes stationarity cannot be 
used. Instead, local indicator of spatial aggregation 
statistics, LISA (e.g., local Moran, local Getis), can be 
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used as they are computed at each sampling location 
and allow the identification of subareas that have sim¬ 
ilar high (“hot spots”) or low (“cold spots”) values [7]. 

Geostatistics 

Spatial structure can be determined in terms of spatial 
autocorrelation as presented above or as spatial vari¬ 
ance according to distance as computed using 
variograms which are part of the family of spatial 
statistical methods known as geostatistics [7, 58]. 
Variograms represent a global method of scale-specific 
analysis that has been used extensively in ecology to 
analyze spatial patterns [58]. Variograms model the 
relationship between lag distance and semivariance 
and can be calculated using continuous raster or 
point data. Semivariance is calculated as the sum of 
the squared differences between pairs of locations sep¬ 
arated by a given lag distance divided by twice the 
number of pairs of locations at that particular lag 
distance [7]. From the observed, or empirical, 
variogram, three parameters can be estimated to fit 
a theoretical variogram: (1) range, or scale at which 
distance does not affect the estimate of variance, (2) sill, 
or the variance of the data, and (3) nugget effect, which 
represents the variability in the data that is not 
accounted for by spatial structure [58]. Theoretical 
variogram models can identify whether there is 
a directional trend in the data, that is, anisotropy as 
the spatial autocorrelation values. In addition to 
describing the attributes of spatial structure, 
geostatistical models can be used to “krige” (i.e., spa¬ 
tially interpolate) data [58] and to simulate spatial 
patterns using a chosen variogram model, process 
model, and parameter estimates [74]. 

Spatial Scale and Scaling 

In addition to understanding the type (e.g., trend, 
patch) and strength (e.g., degree and distance) of 
a spatial pattern, it is useful to identify the spatial scale 
at which such patterns are present. Because patterns are 
the result of multiple processes that each have their own 
unique scales of spatial structure [75] , disentangling the 
relative contributions of these processes and assigning 
relative importance to them and the scales at which 
they operate are of fundamental importance to ecology 
and improving the understanding of complex systems 


[11, 16]. Furthermore, the identification of the relative 
contributions of different processes and scales to 
observed patterns is necessary for understanding 
cross-scale interactions [33] which is necessary to 
make reliable predictions of system dynamics, ostensi¬ 
bly the objective of any spatial analysis [76]. 

As stated above, spatial pattern describes a “quantifi¬ 
able attribute of a spatial context.” Scale-specific analysis 
identifies the spatial scale at which that attribute is 
structured establishes its specific context. With regard 
to forest disturbance dynamics, for example, individual 
forest stands may seem unstable through the processes of 
destruction and renewal through disturbances such as 
fire, but the larger forest landscape (i.e., collection of 
stands) is in fact stable with respect to the proportion 
and relative configuration of the different stand types. 
This is what is meant when disturbance-mediated forest 
systems are described as a shifting mosaic [77] . Different 
conclusions would be drawn about forest stability and 
resilience (sensu [78]) depending on the spatial or 
temporal scale of investigation. 

Scale generally describes the spatial extent, grain, 
and thematic resolution of a set of data [6]. However, 
scale can also be used to refer to a level within an 
organizational hierarchy to which such data pertain, 
such as a population, or community, or ecosystem [79] . 
It is important to note that scale in this latter sense is 
not directly equivalent to the former; scaling up, that is, 
increasing from local to a broader extent, or aggregat¬ 
ing data from a fine to a coarse scale may move the 
analysis into another level or an organizational hierar¬ 
chy, but not necessarily [80]. Although scale is best 
thought of as a feature of the phenomena of interest, 
it can also be a feature of sampling scheme imposed by 
a researcher, or the methods of analysis applied [6]. All 
of these features can influence a researcher’s ability to 
identify scales of structure in spatial data and to make 
meaningful inferences regarding the underlying pro¬ 
cesses. It is therefore important to distinguish structure 
that is emergent from the data from those related to 
sampling or analytic scales (i.e., arbitrary scales; [10]), 
as such a priori scales may have little to do with the 
actual scale of structure in the ecological phenomena of 
interest [76]. 

The ability to identify meaningful scales of spatial 
structure depends on the methods used and the type of 
data being analyzed [77, 81]. Methods differ in their 
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ability to identify local vs. global scales of pattern. 
Global methods of scale-specific analysis summarize 
spatial pattern at a single scale and generally assume 
that the underlying processes are stationary. Examples 
of global methods of analysis include variography [58], 
spectral (i.e., Fourier) analysis [82], and global mea¬ 
sures of spatial autocorrelation such as Moran s I and 
Geary’s c [7]. Multi-scale methods of analysis identify 
both global and local scales of structure, can assign 
relative importance to difference scales, and do not 
assume stationarity. Instead, such methods can be 
used to identify boundaries and scale-specific station¬ 
ary subregions within a larger spatial context [7, 74]. 
Multi-scale methods of analysis include lacunarity 
analysis [83], wavelets [74, 81, 84], distance-based 
Eigenvector methods (e.g., PCNM; [85]), and local 
spatial statistics [86]. 

Wavelet analysis is a particularly powerful method 
of local spatial analysis that can be used to decompose 
continuous data into its scale-specific components 
[87, 88]. A proportion of the total variance in the 
data set is associated with each level of the decomposi¬ 
tion through use of the wavelet variance [84] and the 


relative contributions of different scales to overall 
structure can be assessed and visualized as a scalogram 
(Fig. 4b). Each of these scales of pattern can then 
be isolated using a multi-resolution decomposition 
(Fig. 5; [88]). Under conditions where observed spatial 
pattern is assumed to be the result of multiple 
interacting processes, such data decomposition pro¬ 
vides an opportunity to assess the relationship 
among processes and individual scales of spatial struc¬ 
ture present in the data. In combination with the 
scalogram (Fig. 4b), the relative importance of these 
different scales can also be determined and further 
analyses can be restricted to only those spatial layers 
that correspond to the scales of interest. Isolated scales 
of spatial pattern can then be examined independently 
or used as scale-specific predictors in further statistical 
analyses [89]. 

Whereas Fourier analysis assumes that observed 
patterns can be described as a sum of sine waves of 
different frequencies, wavelet analysis identifies global 
and local structure at different scales using a local 
wavelet template that can take on a wide variety of 
shapes and forms [90]. The majority of wavelet 
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Ecosystems and Spatial Patterns. Figure 4 

Example of a hierarchical multi-scale decomposition of two-dimensional, quantitative data using wavelets, (a) Simulated 
spatial data. (Data were simulated using an exponential variogram model with the following parameters: Sill = 1; 

Range = 40; Nugget = 0.1). (b) Scalogram that summarizes the proportion of total variance in the original data 
associated with each scale of the decomposition. Wavelet decomposition was accomplished using a maximal-overlap 
discrete wavelet transform (MODWT; Percival and Walden [109]) 
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Ecosystems and Spatial Patterns. Figure 5 

Back-transformed images from the decomposition of the data in Fig. 4a. Each panel shows the spatial structure at an 
individual isolated scale 
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applications have been in the analysis of temporal sig¬ 
nals to identify periodicity in things such as climatic 
variability [91] and epidemiological time series 
[92, 93]. However, spatial applications of wavelet anal¬ 
ysis in ecology continue to be developed and have 
been used to investigate one-dimensional forest canopy 
gap structure [87], vegetation reflectance [89], two- 
dimensional structure in grassland productivity [81], 


tree crown identification [94], and the significance of 
spatial structure in forest basal area [74]. 

When data are not sampled in a continuous way (i.e., 
they are irregularly spaced), multi-scale decompositions 
can be performed using spatial eigenfunction analyses 
such as principal coordinate analysis of neighbor matri¬ 
ces (PCNM) and Morans eigenvector maps (MEM) [48, 
95, 96]. These methods model spatial structure in 
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a multivariate framework using distance matrices 
where the spatial coordinates of the sampled sites are 
converted into a set of synthetic spatial variables that 
represent spatial structure at different spatial scales. 
These synthetic variables can then be used in combina¬ 
tion with constrained ordination methods (e.g., canon¬ 
ical correspondence analysis and redundancy analysis; 
[95]) to identify the spatial scales at which data are 
dominantly structured. Jombert et al. [97] also pro¬ 
posed a multi-scale pattern analysis (MSPA) to deter¬ 
mine which of these many scales are the most relevant 
to use as spatial predictors in subsequent analyses such 
as partial ordination or multiple regression [85]. 

Spatial Analyses Among Ecosystems 

The ecotonal interfaces between ecosystems are impor¬ 
tant to delineate as they are the locations where the 
exchange of nutrients and species turnover occurs [7]. 
It is also important to determine not only the boundary 
location between ecosystems, that is, where one system 
begins and another ends, but also its width (i.e., sharp/ 
line or gradual/zone) [98]. There are two different 
types of methods that can be used to determine the 
interface between ecosystems: by creating spatial clus¬ 
ters or by detecting boundaries [99]. In either case, the 
sampling design used to collect the data is crucial: The 
determination of a boundary (i.e., area of high rate of 
changes in the values of variables, hence a heteroge¬ 
neous area) is relative to the two adjacent spatially 
homogeneous ecosystems. Therefore, the sampled 
data should cover enough of both ecosystems such 
that their interface can be detected. 

Spatial Clustering 

To delimit boundaries between ecosystems, spatially 
homogeneous clusters can be determined based on the 
degree of similarity of sample attributes and their spatial 
adjacencies [66,99,100] . The degree of similarity can be 
based on commonly used clustering algorithms 
(agglomerative, k- means, fuzzy logic, etc.) and adja¬ 
cency can be based on network connectivity algorithms 
(e.g., nearest neighbors, minimum spanning tree, 
Gabriel network, Delaunay network; [7]). Spatial clus¬ 
tering provides the membership of each location to 
a spatial cluster and therefore, as a by-product, identifies 
boundaries among clusters. However, clustering 


procedures do not provide any information about the 
location and width of the identified boundaries between 
clusters. When information about location and width 
are needed, other methods should be used such as 
boundary detection methods ([99]; but see [101]). 

Spatial Boundary Analyses 

Ecological boundaries can be defined as areas of high 
rates of change or large absolute differences between 
adjacent locations [102]. Boundary detection methods 
[98] include edge detection algorithms (Laplacian, 
Canny, Sobel, Monomier, etc.), wavelet analysis 
[74, 81], and wombling (lattice, triangulation, categor¬ 
ical). The two former families of methods require the 
data to be in a contiguous fashion (grid) without any 
missing values while the latter one can either be used 
with contiguous or irregularly sampled data. All 
methods compute the magnitude of rate of change 
between adjacent locations over the entire extent of 
the study area. To determine which rates of changes 
are significantly higher than the others, different statis¬ 
tics and significance tests having been developed. 
Wombling typically uses arbitrary percentile thresholds 
of boundary elements to identify significant bound¬ 
aries [7] . James et al. [74] proposed a series of restricted 
randomization procedures to test the significance of 
wavelet boundaries using variogram-based spatial null 
models. Oden et al. [103] developed boundary statistics 
to test the cohesiveness properties of the boundaries. 
Once cohesive boundaries have been detected and 
tested, subsequent hypothesis testing can be performed 
by comparing the spatial overlap and movement of 
boundaries using spatial overlap statistics [7, 104, 105] 
or polygon change analysis [106]. 

Future Directions 

The detection and characterization of spatial structure is 
necessary for developing the understanding of ecosystem 
function and for ensuring sustainable management and 
use of landscape resources. Indeed, the current scale and 
pace of anthropogenic influence on the natural environ¬ 
ment is without precedent and the ways in which novel 
human-created spatial patterns interact with and influ¬ 
ence natural processes are uncertain. Even more uncer¬ 
tain are the relationships among different scales of spatial 
and temporal pattern and what such cross-scale 
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interactions may mean to ecosystem dynamics [107]. 
Future directions in the analysis of spatial pattern will 
require approaches and methods that can begin to tease 
apart the separate scales, both spatial and temporal, of 
processes that contribute to spatial patterns both 
within and among ecosystems. These methods should 
include multi-scale approaches such as the wavelet- 
based methods described above, tools to assess statisti¬ 
cally significant changes over time, and methods that 
can identify local spatially significant subregions within 
larger spatial contexts [74]. Meaningful inference in 
these regards will only be possible if the spatial pattern 
analysis is applied within a hypothesis testing frame¬ 
work, where competing notions of how individual pro¬ 
cesses percolate through the landscape to produce 
pattern can be tested statistically [8]. Finally, increasing 
availability of remotely sensed data (e.g., NDVI, 
LiDAR, Quickbird, LANDSAT, MODIS) will allow 
people to detect spatial patterns at finer spatial and 
temporal scales over much larger spatial extents than 
has been previously possible. These huge amounts of 
data will also require focused, hypothesis-driven ques¬ 
tions, the use of data-mining tools [108], and the 
further development of spatiotemporal statistics. 
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Glossary 

Adaptive governance Institutional and political 
frameworks designed to adapt to changing relation¬ 
ships between society and ecosystems, institutional 
frameworks that enable adaptive management, and 
the facilitation of learning from adaptive manage¬ 
ment to policy. 


Adaptive management A systematic process of natu¬ 
ral resource management whereby management 
actions are treated as experiments to increase learn¬ 
ing and improve subsequent management. 

Natural resource management The management of 
natural resources including land, water, plants, 
and animals to meet societal goals, including 
conservation and exploitation. 

Resilience The capacity of a system to absorb distur¬ 
bance without altering states (undergoing a regime 
shift); a measure of the amount of disturbance 
a system can tolerate before collapsing. 

Structured decision making A general term for 
a framework of analysis of problems to reach deci¬ 
sions based on evidence to meet stated goals. 

Definition of Adaptive Management 

Adaptive management is an approach to natural 
resource management that emphasizes learning 
through management based upon the philosophy that 
knowledge is incomplete and much of what is thought 
to be known is actually wrong, but despite uncertainty, 
managers and policymakers must act [ 1 ]. Although the 
concept of adaptive management has resonated with 
resource management scientists and practitioners 
following its formal introduction in 1978 [2], it has 
and continues to remain little practiced and much 
misunderstood. Misunderstanding is largely based 
upon the belief that adaptive management is what 
management has always been, a trial and error attempt 
to improve management outcomes. But unlike a trial 
and error approach, adaptive management has explicit 
structure, including a careful elucidation of goals, iden¬ 
tification of alternative management objectives and 
hypotheses of causation, and procedures for the collec¬ 
tion of data followed by evaluation and reiteration. 
Since its initial introduction and description, adaptive 
management has been hailed as a solution to endless 
trial and error approaches to complex natural resource 
management challenges and recently, it has become 
increasingly referenced under various forms (please 
refer to following sections) (Fig. 1). Regardless of the 
particular definition of adaptive management used, 
and there are many, adaptive management emphasizes 
learning and subsequent adaptation of management 
based upon that learning. The process is iterative, and 
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Generalization of the different approaches to natural resource management 


serves to reduce uncertainty, build knowledge, and 
improve management over time in a goal-oriented 
and structured process. However, adaptive manage¬ 
ment is not a panacea for the navigation of “wicked 
problems” [3, 4] as it does not produce easy answers, 
and is appropriate in only a subset of natural resource 
management problems where both uncertainty and 


controllability are high (Fig. 2) [5]. Where uncertainty 
is high but controllability is low, scenarios are a more 
appropriate approach. Adaptive management is a poor 
fit for solving problems of intricate complexity, high 
external influences, long time spans, high structural 
uncertainty, and with low confidence in assessments 
[5] (e.g., climate change). However, even in such 
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Adaptive management and scenarios are complementary 
approaches to understanding complex systems. Adaptive 
management functions best when both uncertainty and 
controllability are high, which means the potential for 
learning is high, and the system can be manipulated 
(Adapted from [60]) 


situations, adaptive management may be the preferred 
alternative, and can be utilized to resolve or reduce 
structural uncertainty. 

Clearly, adaptive management has matured, but it 
has also reached a crossroads. Its application is now 
common to a variety of complex resource management 
issues, and while practitioners and scientists have 
developed adaptive management and structured deci¬ 
sion-making techniques, and mathematicians have 
developed approaches to reducing the uncertainties 
encountered in resource management, there continues 
to be misapplication of the method, and misunder¬ 
standing of its purpose. 

Introduction 

Adaptive management of natural resources did not 
spontaneously appear, but represents an evolving 
approach to natural resource management in particu¬ 
lar, and structured decision making in general. 
Founded in the decision approaches of other fields 
[6] including business [7], experimental science [8], 
systems theory [9], and industrial ecology [10], the 
first reference to adaptive management philosophies 
in natural resource management may be traced back 
to Beverton and Holt [11] in fisheries management, 


though the term “adaptive management” was yet to be 
used (reviewed in [6]). The term “adaptive manage¬ 
ment” would not become a common vernacular until 
C.S. Holling, widely recognized as the “father” of adap¬ 
tive management, produced his edited volume on the 
subject “Adaptive Environmental Assessment and 
Management” in 1978 [2]. The work was spawned by 
the experiences of Holling and colleagues at the 
University of British Columbia following from the 
development of resilience theory [12]. The concept of 
resilience, predicated upon the existence of more than 
one alternative stable state for ecosystems, had several 
ramifications. For one, it meant that managers should 
be very careful not to exceed a threshold that might 
change the state of the system being managed, and the 
location of those thresholds is unknown. Second, for 
ecological systems in a favorable state, management 
should focus on maintaining that state, and its resil¬ 
ience. Adaptive management, then, was a method to 
probe the dynamics and resilience of systems while 
continuing with “management,” whereby management 
experiments were developed to enhance learning and 
reduce uncertainty, in a fail-safe manner. According to 
H oiling ( http: / / www. res alliance. o r g/2561. p hp ): 

► The resilience research led us to mobilize a series of 
studies of large scale ecosystems subject to manage¬ 
ment- terrestrial, fresh water and marine. All this was 
done with the key scientists and, in some cases, policy 
people who "owned" the systems and the data. So the 
process encouraged two major advances. One advance 
developed a sequence of workshop techniques so that 
we could work with experts to develop alternative 
explanatory models and suggestive policies. We 
learned an immense amount from the first experiment. 
That focused on the beautiful Gulf Islands, an archipel¬ 
ago off the coast of Vancouver. We chose to develop 
a recreational land simulation of recreational property. 

I knew little about speculation, but we made up 
a marvelous scheme that used the predation equations 
as the foundation- the land of various classes were the 
"prey," speculators were the "predators" and a highest 
bidder auction cleared the market each year. The equa¬ 
tions were modifications of the general predation 
equations. The predictions were astonishingly effective 
and persisted so for at least a decade. As much as 
anything, it reinforced the earlier conclusion that 





Ecosystems, Adaptive Management 


3345 



these equations were powerful and general. But the 
important conclusion concerned the workshop 
process and the people. 

Eventually Carl Walters [1] built upon Holling’s 
foundational contribution [12] and further developed 
the ideas, especially in the realm of mathematical 
modeling. Whereas Holling’s original emphasis was in 
bridging the gap between science and practice, Walters 
emphasized treating management activities as designed 
experiments meant to reduce uncertainty. Both scien¬ 
tists sought an approach that allowed resource man¬ 
agement and exploitation to continue while explicitly 
embracing uncertainties and seeking to reduce them 
through management. Walters [1] described the pro¬ 
cess of adaptive management as beginning “with the 
central tenet that management involves a continual 
learning process that cannot conveniently be separated 
into functions like research and ongoing regulatory 
activities, and probably never converges to a state of 
blissful equilibrium involving full knowledge and opti¬ 
mum productivity.” He characterized adaptive man¬ 
agement as the process of defining and bounding the 
management problem, identifying and representing 
what is known through models of dynamics that iden¬ 
tify assumptions and predictions so experience can 
further learning, identifying possible sources of uncer¬ 
tainty and identifying alternate hypotheses, and finally 
the design of policies to allow continued resource man¬ 
agement or production while enhancing learning. 

A key focus of adaptive management is the identi¬ 
fication and reduction, where possible, of uncertainty. 
Uncertainty is reduced through management experi¬ 
ments which enhance learning. Williams [6] describes 
four critical sources of uncertainty: 

1. Environmental variation is often the most common 
source of uncertainty, and is largely uncontrollable. 
It may have a dominating influence on natural 
resource systems, through such factors as random 
variability in climate. 

2. Partial observability refers to uncertainty about 
resource status. An example of this is the sampling 
variation that arises in resource monitoring. 

3. Partial controllability arises when indirect means 
(e.g., regulations) are used to implement an action 
(e.g., setting a harvest rate), and it can lead to the 
misrepresentation of management interventions 


and thus to an inadequate accounting of their influ¬ 
ence on resource behavior. 

4. Structural or process uncertainty arises from a lack of 
understanding or agreement regarding the struc¬ 
ture of biological and ecological relationships that 
drive resource dynamics. 

Adaptive Management Today 

Adaptive management has been referenced either 
implicitly [11] or explicitly [2, 13] for more than 50 
years, but despite an illustrious theoretical history, 
there has remained imperfect realization of adaptive 
management in real world natural resource manage¬ 
ment decisions. The limited implementation of adap¬ 
tive management stems from three fundamental 
problems: (1) a lack of clarity in definition and 
approach, (2) a paucity of success stories upon which 
to build [14-18], and (3) management, policy, and 
funding paradigms that favor reactive rather than pro¬ 
active approaches to natural resource management 
[19, 20]. Each of these challenges has slowed the 
development of adaptive management as a paradigm 
for natural resource management and resulted in 
incomplete, inefficient, and even inappropriate 
implementation of adaptive management. 

Although semantic arguments may seem the realm 
of ivory-towered professors, inconsistent and even con¬ 
tradictory approaches and definitions of adaptive man¬ 
agement have resulted in confusion and limited the 
ability of management organizations to develop consis¬ 
tent and repeatable comprehensive adaptive manage¬ 
ment programs. Ironically, the confusion over the term 
“adaptive management” may stem from the flexibility 
inherent in the approach which has resulted in multiple 
interpretations of “adaptive management” that fall 
upon a continuum of complexity and a priori design, 
starting from the simple (e.g., “learning by doing”) and 
progressing to the more explicit (e.g., “a rigorous 
process that should include sound planning and exper¬ 
imental design with a systematic evaluation process 
that links monitoring to management”) [2, 21, 22]. 
Obviously, there is a clear distinction in intent, 
investment, and success between approaches that pro¬ 
pose to learn from prior management decisions and 
those that outline a concise feedback mechanism 
dependent upon sound scientific principles on which 
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future management decisions will be made. The defi¬ 
nition of “adaptive management” is further confused 
because one of the powerful attributes of adaptive 
management is the ability to simultaneously address 
multiple needs of managers, scientists, and stake¬ 
holders. The result has been published reports of adap¬ 
tive management that emphasize definitions that focus 
on the needs of the authors and the ability of adaptive 
management to meet those needs (e.g., experimenta¬ 
tion [14], uncertainty [23], changing management 
actions [24], monitoring [25], and stakeholder involve¬ 
ment [26]). 

Despite the challenges in defining adaptive man¬ 
agement, momentum and interest in the subject and its 
application continue to grow. The recent development 
by the United States Department of Interior of an 
adaptive management technical guide (http://www. 
doi.gov/initiatives/AdaptiveManagement/TechGuide. 
pdf) and the policies developed around this manual to: 

► Incorporate adaptive management principles, as 
appropriate, into policies, plans, guidance, agree¬ 
ments, and other instruments for the management of 
resources under the Department's jurisdiction. - 
Department of Interior Manual (522 DM 1) 

are an indication of the growing movement in natural 
resource management toward taking a more proactive 
role in management decisions. Unfortunately, this 
movement has little to build upon with one clear 
exception, the U.S. Fish and Wildlife Service 
(USFWS) Adaptive Harvest Management Plan 
(AHM) for mid-continent mallards. Worldwide, 
AHM is one of the few successful efforts to apply the 
principles of adaptive management and demonstrate 
how to successfully manage natural resources by 
improving the understanding of natural systems 
through management actions. The adaptive manage¬ 
ment processes of AHM have greatly improved the 
understanding of the harvest potential of waterfowl 
populations, the ability of managers to regulate harvest, 
and the importance of monitoring and assessment 
programs to support the decision-making process. 

So why has AHM succeeded while so many other 
attempts to implement adaptive management have 
stalled? First, AHM developed a clear and concise 
objective: maximize long-term waterfowl harvest 
while ensuring long-term viability of waterfowl 


populations. The development and agreement by 
stakeholders to a concise set of fundamental objectives 
is paramount to ensuring the success of any adaptive 
management program. Failure to agree upon funda¬ 
mental objectives and unwarranted attempts to alter 
objectives will ensure any attempt to manage, whether 
adaptive or not, will fail. The second key to the AHM 
success was due to simultaneous support for manage¬ 
ment, research, and monitoring. Waterfowl research 
and management in North America are nearly 
unequaled by almost any natural resource management 
program in terms of history, scope, and investment 
[27]. The enormity of historical and current data and 
the availability of resources for researchers and man¬ 
agers to utilize that data have facilitated the develop¬ 
ment of innumerable research and management 
activities all of which have fed back into the AHM 
process. In addition, the AHM program has arguably 
one of the most comprehensive monitoring programs 
for any ecological system currently under study. The 
combination of well-supported management, research, 
and monitoring programs has resulted in a clear reduc¬ 
tion in the uncertainty of how waterfowl populations 
respond to management and enabled managers and 
policy makers to more effectively meet their stated 
objectives. Unfortunately, too often, attempts to imple¬ 
ment adaptive management fail to address all of the 
requirements. In particular, resources for monitoring 
and research are often undervalued with the resultant 
outcome being a series of management actions with no 
understanding of their implications. 

The final key to the success of AHM has been the 
ability to implement management and policy deci¬ 
sions based on the best information available. In 
many historical and current attempts to implement 
adaptive management, the regulatory body charged 
with implementation of management recommenda¬ 
tions either is unable, or worse, is unwilling to 
implement actions proposed by the outcome of the 
adaptive management process. The body in charge of 
regulatory control is too often a stakeholder in the 
process of adaptive management with an agenda inde¬ 
pendent of regulating the resource alone. There may 
even be, and often are, several regulatory agencies 
controlling resources, each an independent stake¬ 
holder, each with an independent agenda. Such a sit¬ 
uation can make implementation of a management 
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recommendation challenging, especially if it contra¬ 
dicts long-standing dogma. Consider for example, the 
management of Glen Canyon Dam and the waters of 
the Colorado River. Heralded by Congress as an adap¬ 
tive management success story, the Colorado River 
Adaptive Management Program has fallen short of 
success because despite 13 years of work, the ecologi¬ 
cal status of the Colorado River and the conflict inher¬ 
ent to the development of an adaptive management 
program continue to worsen [28]. This is because the 
regulatory agency that controls the flow of water 
throughout the Colorado River Basin, the Bureau of 
Reclamation, is also one of the major stakeholders in 
the adaptive management process with an agenda 
(water storage) that conflicts with several other stake¬ 
holders and regulatory agencies that manage people 
and wildlife along the Colorado River (e.g., California 
Department of Water Resources, Mexican National 
Water Commission, USFWS). In contrast to the man¬ 
agement of the Colorado River, there is a single cen¬ 
tralized regulatory body governing waterfowl harvest in 
the United States (USFWS), and although there are 
many stakeholders that play a role in setting harvest 
management regulations, ultimately, decisions are 
made by the USFWS. Equally important, the interests 
of the USFWS parallel those of the other stakeholders. 
For the Colorado River, stakeholder interests are 
almost directly at odds. So from these examples is one 
to conclude that adaptive management is an 
unattainable mandate for the management of resources 
where various stakeholders and regulators are at odds? 
No, implementation of adaptive management is appro¬ 
priate in both examples, possibly even more so for the 
management of the Colorado River. What the 
Colorado River example highlights is the importance 
of collaboration, the benefits of a single or 
superregulatory body, and the need to agree upon 
a priori objectives that guide long-term management 
decisions despite short-term political, societal, eco¬ 
nomic, or even environmental impacts. 

Structured Decision Making 

A key component of any management approach, 
whether it is adaptive or not, is deciding on the objec¬ 
tives, goals, and ultimately management options that 
may best achieve the desired goals (Fig. 3). 


Unfortunately, as with many decisions, deciding 
upon a proper set of objectives and the means to 
reach those objectives can prove challenging. Resource 
management decisions are further complicated 
because social-ecological systems are complex (e.g., 
multiple objectives and stakeholders, overlapping 
jurisdictions, short- and long-term effects) and are 
characterized by a high degree of uncertainty (e.g., 
appropriate management action or monitoring pro¬ 
tocols, future economic or ecological conditions) and 
therefore present decision makers with challenging 
judgments (e.g., predicted consequences of proposed 
alternatives, value-based judgments about priorities, 
preferences, and risk tolerances) often under enor¬ 
mous pressure (economic, environmental, social, 
and political) and with limited resources to ensure 
success. The resulting outcome of such conditions 
too often leads to management paralysis, or continu¬ 
ation of the status quo, as managers and policy makers 
become overwhelmed by the process of the decision 
and lose track of the desired social-ecological condi¬ 
tions they are charged with achieving. Indeed, the 
process of resource management can be arduous and 
even controversial, particularly if there are a variety of 
stakeholders vying to push the agenda. Fortunately, 
there are methods to overcome these pitfalls and max¬ 
imize the potential for success. 

One method to overcome management paralysis 
and mediate multiple stakeholder interests is struc¬ 
tured decision making. Borrowed from the sociological 
fields, structured decision making is an organized 
approach to identify and evaluate alternative resource 
management options by engaging stakeholders, 
experts, and decision makers in the decision process 
and addressing the complexity and uncertainty inher¬ 
ent in resource management in a proactive and trans¬ 
parent manner. Structured decision making uses 
a simple set of steps (Fig. 3) to evaluate a problem 
and integrate planning, analysis, and management 
into a transparent process that provides a roadmap 
focused on achieving the fundamental objectives of 
the program. It differs somewhat from “active” adap¬ 
tive management in that it does not emphasize repli¬ 
cated management experiments (Fig. 4). Central to the 
success of the structured decision making process is the 
requirement to clearly articulate fundamental objec¬ 
tives, explicitly acknowledge uncertainty, and respond 
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The minimum steps necessary to implement a structured decision-making process: More complex integration of 
individual steps may be necessary if future steps clarify the process or if the decision is iterative over time 
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Structured decision making and adaptive management 
differ somewhat, especially in that active adaptive 
management emphasizes the utilization of multiple 
replicated management experiments. As such, learning 
may be faster when such experiments are possible. 
However, adaptive management and structured decision 
making are terms often used interchangeably 


transparently to all stakeholders’ interests in the deci¬ 
sion process. The conceptual simplicity inherent in 
structured decision making makes the process useful 
for all decisions from minor decisions to complex 
problems involving multiple stakeholders. 

Structured Decision Making Steps 

1. Define the Problem - The first step in a structured 
decision making process is a clear and concise eval¬ 
uation and articulation of the problem being 
addressed and the motivation underlying the need 
to address the problem. Although identifying the 
problem may seem self-evident, failure to clearly 
articulate the problem to all stakeholders and sub¬ 
sequent agreement by stakeholders as to the nature 
of the problem is often cited as the primary reason 
management and policy actions fail, or worse, face 
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future litigation. To facilitate this process, decision 
makers need to ask: 

(a) What specific decision(s) have to be made? 

(b) What is scope of the decision (e.g., geographic, 
temporal)? 

(c) Will the decision be iterated over time? 

(d) What are the constraints within which the deci¬ 
sion will be made (e.g., logistical, ecological, 
legal, temporal, financial)? 

(e) What stakeholders should be involved in the 
decision process and what are their respective 
roles? 

2. Identify the Objectives - The centerpiece of the 
structured decision making process is a set of clearly 
elucidated objectives. Together they define the “why 
do we care” about the decision and thereby facili¬ 
tate the search for alternatives, and become the 
metric for comparing and evaluating management 
outcomes. When defining objectives, there are 
many considerations to ensure that decision makers 
can adequately evaluate alternatives. Ideally, objec¬ 
tives are stated in quantitative terms that relate to 
parameters that can be measured and thus evalu¬ 
ated. More importantly, objectives are meant to 
focus efforts on the importance of the decision in 
a consistent and transparent manner that exposes 
key trade-offs and uncertainties so decision makers 
can generate creative and proactive alternatives. 
Objectives should be complete, controllable, con¬ 
cise, measurable, and understandable [29]. To 
achieve this end requires “brainstorming” with 
stakeholders to identify what is important about 
the decision at hand. The outcome of such an effort 
may produce a wide variety and often extensive list 
of objectives that will need to be simplified to focus 
on things that matter and the direction they need to 
move (e.g., maximize deer harvest or minimize 
erosion). It is important to note, that unlike goals 
or targets, objectives do not have specific quantita¬ 
tive outcomes (e.g., 50% increase), but are meant to 
define the preferred ends and the direction of 
change to meet that ends. 

Once a list of objectives has been defined, it is 
important to separate the objectives into funda¬ 
mental objectives (which reflect the ultimate 
goals) and means objectives (which are ways of 
achieving the ends) to ensure that management 


actions really effect the defined problem. For exam¬ 
ple, “maximize sandbars” may be an important 
objective for the management of a river like the 
Missouri or Platte, but if the river system is being 
managed for wildlife, sandbars are primarily 
important because they increase breeding habitat 
for threatened and endangered terns and plovers. 
“Maximize sandbars” is thus a means objective 
toward reaching the fundamental objective of 
“maximize tern and plover population size.” 
Clearly, there are other means objectives that 
would also facilitate this fundamental objective 
(e.g., minimize nest predation, maximized food 
availability, etc.). The benefit of the process of 
distinguishing objectives is that the identification 
of means objectives can help lead to alternative 
management actions (e.g., build sandbars, release 
reservoir water), while the identification of funda¬ 
mental objectives gives a basis for evaluating and 
comparing alternatives (annual tern and plover 
population size). Keep in mind, however, that the 
status of fundamental or means is not an innate 
quality of an objective, but rather is highly context 
dependent. Thus, what was a means objective for 
one decision, in the example “maximize sandbars,” 
may be a fundamental objective for another if the 
decision problems shifts from say “wildlife manage¬ 
ment” to “aesthetics” or “flow.” 

After developing a careful list of objectives, it 
can be useful to develop a hierarchy or means-ends 
diagram to group similar objectives and clarify the 
links and relationships between means and funda¬ 
mental objectives. An objectives hierarchy can help 
clarify the context of each fundamental objective by 
identifying all the important elements that are 
affected by the decision process and demonstrate 
to stakeholders the importance of all objectives 
even those that are not “fundamental objectives.” 

3. Identify Management Alternatives - Management 
success is only as likely as the creativity and diver¬ 
sity of possible management alternatives. Unfortu¬ 
nately, management paralysis, “pet” management 
actions, and staying with the status quo too often 
limit managers and policy makers to few options 
and thereby impede management success. The pro¬ 
cess of identifying management alternatives, like 
the process of identifying objectives, starts with 
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brainstorming. Identifying alternative management 
actions is a process that should be addressed itera¬ 
tively, as knowledge of best practices and the crea¬ 
tivity to develop novel ideas should not be expected 
to develop instantaneously. The key is bringing the 
“right” people together. It is important to have 
a group with a set of interdisciplinary backgrounds 
that represent the larger decision to ensure that the 
needs of stakeholders are not overlooked. This is 
not to say that the stakeholders involved in identi¬ 
fying alternative management actions are the same 
as the larger stakeholder group, usually they are not. 
This is primarily due to the technical knowledge 
necessary to present plausible alternatives. Still 
there are opportunities where the benefit of being 
naive may present novel actions that might not 
otherwise be considered. 

The brainstorming process should begin by 
identifying alternatives for individual objectives, 
but always be looking for opportunities when one 
action may fulfill the needs of multiple objectives. 
Identifying alternatives also means being mindful 
of those actions that must be done (e.g., standing 
policy), constraints (real or perceived) and poten¬ 
tial trade-offs between objectives and various man¬ 
agement actions. In developing alternatives, it is 
important that the “brainstorming” process focus 
on developing high-quality management actions 
that are: (1) explicitly designed to address the 
outlined objectives, (2) technically sound in that 
they build on the best known practices, (3) concise 
yet comprehensive enough to include the technical 
understanding for implementation, (4) designed to 
expose trade-offs between the decision process by 
having mutually exclusive strategies, and (5) devel¬ 
oped to achieve the greatest good for the stake¬ 
holders involved. 

Once an extensive list of alternatives has been 
identified, it can be useful to group them into 
strategies or portfolios based on general similarities 
in what they aim to achieve. Sometimes these port¬ 
folios can represent the needs of specific stake¬ 
holder groups or specific conditions that could be 
achieved. For example, management actions on 
a river system may be grouped together into port¬ 
folios that meet the needs of sport-fishery, endan¬ 
gered species, or irrigation; alternatively, they may 


be grouped based on their ability to return the river 
to 50%, 75%, or 95% of historical flows. Both 
methods have merit, the first in that it is generally 
clear to the stakeholders what objectives are being 
met and then where trade-offs must be considered, 
and the second in that the inherent interests of any 
particular group are not the driving factor and thus 
the process can be less contentious. 

4. Elucidate Consequences - The list of alternative 
management actions is only effective if it creates 
an opportunity to evaluate and compare actions 
in light of the objectives before implementation. It 
is important to realize that the process of identify¬ 
ing management consequences is not a value judg¬ 
ment, but an analytical assessment of the most 
likely outcome of the action(s). Using the best sci¬ 
entific knowledge available, this process is 
a modeling exercise focused on predicting the likely 
outcomes of each alternative and thus the likeli¬ 
hood that each achieves the desired objective. 
Depending upon our knowledge of the system, 
this process can be highly quantitative where exten¬ 
sive data are modeled and probabilities assigned to 
each outcome or as is often the case, if little or 
nothing is known about the system, this process 
can depend heavily on expert opinion or compari¬ 
sons to similar systems. In both cases, there is 
a degree of uncertainty associated with predicted 
outcomes as well as the parameters included in the 
modeling process. Indeed, because system function 
is rarely precisely understood, the effects of man¬ 
agement actions are never certain and the future 
states are unknown, decisions are almost always 
made in the face of uncertainty. Uncertainty can 
make differentiating among alternatives difficult, 
but because uncertainty is an inherent part of the 
decision process, it must not be ignored. It is 
important that uncertainty be confronted through¬ 
out the decision process and that the uncertainties 
are identified and the possible impacts on the sys¬ 
tem and the ability to achieve stated objectives 
documented. 

Once the modeling process has predicted the 
likely outcomes of each management action and 
the corresponding ability to address each objective, 
the next step is to develop a consequence table. The 
purpose of a consequence table is to produce 
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a visual summary of the consequences of each 
potential management action on each of the objec¬ 
tives in a table or matrix. A consequence table can 
take a variety of forms, from a simple rating system 
(e.g., consumer report 5-star rating) to a complex 
table with specific probabilities of outcomes and 
subsequent likelihoods of achieving each objective. 
Independent of the complexity of the underlying 
models that populate the matrix, the purpose of the 
consequence table is to ease and facilitate direct 
comparison of each management actions’ ability 
to achieve each objective. 

5. Identify and Evaluate Tradeoffs - Ideally the struc¬ 
tured decision making process would lead to a clear 
management alternative that achieves the objectives 
of all interested parties; unfortunately, this is rarely 
the case. Generally, the process of developing 
a consequence table will clearly elucidate which 
options are the least likely to be effective, but if 
there are multiple stakeholders and thus multiple 
objectives, most decisions will require a trade-off 
between the ability of the remaining options to 
achieve each objective. The process of identifying 
where these trade-offs arise is analytical, but the 
decision process itself is highly value laden and 
thus dependent upon stakeholders. In most com¬ 
plex decisions, this will involve stakeholders choos¬ 
ing between less-than-perfect alternatives. There 
are a variety of methods to facilitate highly value¬ 
laden decisions by weighing options based on the 
values of the stakeholders and then comparing 
alternatives to find the “best” compromise solu¬ 
tions. However, trade-offs are real and it is unlikely 
that all parties will be totally satisfied with the 
eventual outcome. Indeed, although consensus is 
ideal, it is not necessary and is often unachievable; 
however, the benefit of the structured decision¬ 
making process is that even if there is disagreement, 
the process makes the disagreement transparent 
and enables stakeholders to re-evaluate using new 
knowledge and/or perspectives. 

6. Implement Management Action - The final step in 
the structured decision-making process is imple¬ 
mentation. Although this may always seem to be 
the desired outcome of a decision process, unfor¬ 
tunately, social and political pressures to reach 
“perfection” often impede implementation and 


leave decisions in a continuous state of inaction. 
To ensure success, managers, policy makers, and 
stakeholders must work together to move through 
the decision process in a timely manner to ensure 
action can be taken. Failure to take action is 
a decision, whether it is made passively or actively. 

Participatory Active Adaptive Management 

Panarchy is a useful model for characterizing ecological 
systems and the formal institutions that manage these 
systems [30]. One of the most critical aspects in the 
panarchy appears to be a bridging organization that can 
monitor the status of the social-ecological system, and 
manifest rapid change, if conditions are deteriorating 
[31] . Monitoring will allow for management to set new 
target levels, and modify policy to reach those target 
levels, as new information is generated on scale-specific 
system attributes [32]. In order for management 
entities operating at discrete scales to improve commu¬ 
nication channels and create opportunities for collab¬ 
oration, intermediate level entities may serve to 
facilitate these cross-scale linkages. Bridging organiza¬ 
tions have the capacity to fulfill this role and organize 
cooperation between stakeholders across scales [33], 
but to do so successfully, one must formulate strategies, 
coordinate joint action, address uncertainty, and link 
diverse stakeholders in a world of increasing complex¬ 
ity. Brown [33] investigated bridging organizations 
from across the world, and from a variety of scopes 
(e.g., regional economic policy in the USA; small-scale 
irrigation projects in Indonesia; agricultural productiv¬ 
ity in Zimbabwe) found that bridging organizations are 
independent of stakeholders in a social-ecological sys¬ 
tem, which allows them to negotiate with stakeholders 
and advocate multiple positions. This unique role in 
the management of social-ecological systems affords 
bridging organizations the capacity to catalyze the for¬ 
mation of policies that are flexible and reflective of the 
panarchy of ecosystems and institutions [33]. In addi¬ 
tion, bridging organizations have the capacity to reduce 
transaction costs, and provide a mechanism to enforce 
adherence to desired policies, despite their lack of reg¬ 
ulatory authority [34]. 

Examples of bridging organizations include: 
(1) assessment teams, which are made up of actors 
across sectors in a social-ecological system; 
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(2) nongovernmental organizations, which create an 
arena for trust-building, learning, conflict resolution, 
and adaptive co-management; and (3) the scientific 
community, which acts as a “watchdog,” as well as 
a facilitator, for adaptive management. For pur¬ 
poses of environmental management, an example 
of a successful bridging organization is that of 
Ekomuseum Kristianstads Vattenrike (EKV), a small, 
municipal organization that facilitated progressive eco¬ 
system management in southern Sweden [34]. EKV 
was tasked with managing water resources at 
a regional scale in Sweden, and was successful largely 
because it employed organizational flexibility that 
allowed for EKV to respond quickly to “surprise.” 
This was achieved through leadership, a core interdis¬ 
ciplinary staff, and the facilitation of connections 
between individuals and organizations (i.e., the 
panarchy of institutions) in the social-ecological sys¬ 
tem. EKV was able to improve the social capacity to 
respond to “surprises” and create the trust necessary to 
push the social-ecological system toward improved 
adaptive management of resources. 

The formal management institutions in place are 
likely to persist barring a large-scale perturbation to 
social-ecological systems. So, managers must operate 
within the limitations of these institutions, which com¬ 
plicates matters, but does not make the situation intrac¬ 
table. One possible option for improving environmental 
management, as highlighted in this section, appears to 
be in developing bridging organizations that catalyze 
cross-scale communication across the panarchy of insti¬ 
tutions and ecosystems, and explicit recognition of the 
underlying cross-scale structure and nonlinear interac¬ 
tions of these linked systems, by both policy and policy 
makers. The lack of communication and cooperation 
between institutions at even small scales further illumi¬ 
nates that bridging organizations may help bring about 
effective management of natural resources at multiple 
scales [35]. Thus, bridging organizations should act 
as mini think tanks that facilitate communication 
between institutions, incubate new ideas for environ¬ 
mental management, and provide a forum for coming 
to agreements on contentious issues [36]. 

Bridging organizations play a critical role in facili¬ 
tating adaptive comanagement and governance, and are 
essential to managing for resilience in social-ecological 
systems [37]. Perception of a particular policy can play 


a significant role in whether it is accepted by critical 
stakeholders in a social-ecological system [38]. Engag¬ 
ing stakeholders, implementing change at a suitable rate, 
and providing outreach to keep the public informed 
are all important for new environmental policy to be 
perceived of as positive and for a successful transition 
to a new policy regime [38]. This environmental man¬ 
agement framework, which incorporates panarchy, 
adaptive management, and bridging organizations, 
could serve as one scenario in the suite of policy 
options for actualizing sustainability [30]. 

Adaptive Governance 

Administrative agencies typically change incrementally 
[39], and as such, changes in policy are small because 
there is not enough information to make large over¬ 
hauls of organization policy. Standard operating pro¬ 
cedures are another mechanism that contributes to 
organizational inertia, as they slow the bureaucratic 
process [40]. Further, the lack of institutions matched 
to the appropriate scale is a significant barrier for 
sound environmental management [41]. Within this 
context, adaptive governance can help with this scale 
mismatch via collaboration of a diverse set of stake¬ 
holders at multiple scales [42]. Adaptive governance is 
a form of governance that incorporates formal institu¬ 
tions, informal groups/networks, and individuals at 
multiple scales for purposes of collaborative environ¬ 
mental management [43]. Bridging organizations, 
enabling legislation and government policies can also 
contribute to the success of an adaptive governance 
framework; governance creates a vision and manage¬ 
ment actualizes the vision [43]. 

Adaptive governance works via sharing of manage¬ 
ment power and responsibilities, and promotes 
a collaborative, participatory process, but is dependent 
upon adaptive comanagement, and adaptive 
comanagement is dependent upon social networks for 
success. Social networks have the capacity to allow for 
development of new ideas, to facilitate communication 
between entities, and to create the flexibility necessary 
for the interplay of the fluid (ecological systems) and the 
rigid (institutions) to be successful for environmental 
management [43]. Leadership has been well established 
as a critical factor in facilitating good environmental 
management. Leaders develop and facilitate a vision for 
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environmental management, incorporating local 
knowledge and information from social networks [43]. 

Olsson et al. [44] studied adaptive comanagement 
in Sweden and Canada and concluded that this form of 
management of ecological systems was most effective 
when there was: leadership with vision for the system of 
interest; legislation that created the environment for 
adaptive management; funds for adaptive manage¬ 
ment; monitoring of the ecological system; informa¬ 
tion flow (i.e., cross-scale linkages); combination of 
a variety of sources knowledge; and venue for collabo¬ 
ration. Olsson et al. [44] contend that these factors are 
critical to building resilience in social-ecological sys¬ 
tems, as they help to protect the system from the failure 
of management decisions under uncertainty (i.e., 
imperfect information). Further, they assert that adap¬ 
tive comanagement is necessary to facilitate adaptive 
governance. In turn, adaptive governance is facilitated 
by informal networks and leadership, which creates the 
capacity for development of novel ideas for environ¬ 
mental management [43]. These informal networks 
have the capacity to generate political, financial, and 
legal support for novel environmental management 
[43]. Further, adaptive governance is dependent upon 
polycentric institutions that are redundant (e.g., scale- 
specific) and are quasi-autonomous [45]. Olsson et al. 
[45] compared five case studies from around the world 
and concluded that in order for a social-ecological 
system to transition to adaptive governance, it must 
undergo a preparation and a transformation phase, 
linked by a window of opportunity. 

In a well-cited example (Kristianstads Vattenrike) 
from Sweden, Olsson et al. [45] report the transition to 
adaptive governance was preceded by the develop¬ 
ment of a social network of parties interested in the 
management of the social-ecological system. The net¬ 
work consisted of members from local groups (envi¬ 
ronmental groups, farmers’ associations), local 
government (municipality of Kristianstad, the County 
Administrative Board), and national scale (World 
Wildlife Fund, National Museum of Natural History, 
National Research Council). In case studies that have 
not resulted in a successful transition to adaptive 
governance, the social networks needed to help facil¬ 
itate the transition were not well developed, and this 
hindered the changes needed for good environmental 
management [45]. 


The role of leadership has also been cited as critical to 
a transition to adaptive governance, and Olsson et al. 
[45] provide an example of leadership from 
Kristianstads Vattenrike. A key individual acted as 
a catalyst to social network formation, setting the 
research agenda, and mobilizing support at multiple 
scales for “new” environmental management. Critical 
to setting an agenda is defining how an issue becomes 
perceived as a “public problem because if most individ¬ 
uals accept a particular condition, negative feedback 
works to maintain public opinion in that particular 
regime” [46]. However, if the individuals in the regime 
develop a “critical mass” of distaste for a particular issue, 
public opinion can cross a threshold and reorganize into 
an alternative regime. Importantly, interest groups, the 
media, and other agents can have an effect on agenda 
setting and creating the “climate” necessary for a shift in 
public opinion [46] . There are critical roles to be played 
by individual actors in shifting policy from one regime 
to an alternate regime. For instance, social networkers 
that share information freely; individuals that have 
numerous, diverse connections; and individuals with 
powerful ability to persuade play key roles in policy 
change [47]. These individuals can interact to create 
the conditions necessary for regime shifts in public 
policy. In particular, the director of a municipal orga¬ 
nization (Ekomuseum Kristianstads Vattenrike) filled 
this leadership role and served as a bridging organiza¬ 
tion that also was a significant factor in the transition to 
adaptive governance [45]. The leadership needed to 
foster a transition to adaptive governance is not neces¬ 
sarily the work of one individual, but rather is often 
encompassed by several individuals and entities [48]. 

There are two types of policy windows: a problem- 
driven window and a politically driven window [49]. 
A problem-driven window opens when a policymaker 
believes that a policy is necessary for a specific issue. 
A politically driven window is driven by a particular 
theme adopted by a policymaker, in which the 
policymaker looks for problems that fit within the 
theme. Significant changes in policy occur when con¬ 
ditions (e.g., problems, solutions, and politics) con¬ 
verge at the same time, which creates the window of 
opportunity for change [49]. In the Kristianstads 
Vattenrike example, social and ecological change at 
one scale triggered cross-scale effects which resulted 
in a window of opportunity for the transition to 
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adaptive governance [45]. In adaptive governance, 
decision making is not top-down but rather emerges 
from outreach and group meetings with stakeholders 
[50]. In order for adaptive governance to be effective, 
the policy requires strong leadership, communication, 
and incorporation of uncertainty, which allows for 
adaptation to changing circumstances [50]. 

Adaptive Management and Law 

Legal certainty is an aspect of law that does not mesh 
well with environmental unpredictability. One of the 
most significant barriers for managing linked social- 
ecological systems is that often the aspects of a society 
that make it free (e.g., certainty of law) are not in 
concert with ecological realities (e.g., multi-regimes, 
nonlinear systems, and responses) [51]. The certainty 
of law and institutional rigidity often limit experimen¬ 
tation that is necessary for adaptive management [30]. 
This point is critical, as some scholars contend that 
environmental governance of the commons can only 
succeed if rules evolve with the system of interest [41]. 

Ecosystem management has been applied within 
the outdated framework of the Endangered Species 
Act (ESA), but ecosystem management is best 
implemented via adaptive management [52]. In its 
current form, the ESA does not have the necessary 
flexibility in its regulatory language to effectively 
implement adaptive responses to changing environ¬ 
mental conditions [52]. The legal constraints upon 
adaptive management in the American system of law 
do not stop there. The fundamental constraint to adap¬ 
tive management is the current state of administrative 
law [53]. As the law now stands, the procedural rules 
require a vast amount of work before an agency 
promulgates a rule or issues a permit [54]. This 
“pre-decision” activity allows for public input and pre¬ 
pares agencies for judicial review. Ruhl [54] contends 
that “agencies will find that interest groups and courts 
relentlessly will erode adaptive agency behavior, using 
all the tools conventional administrative law puts at 
their disposal.” Having to operate in an atmosphere 
where each policy is evaluated on the “front-end,” in 
anticipation of public and legal scrutiny, has squelched 
agencies’ appetite for adaptive management. 

US administrative law is a two-step process, in 
which the first step allows for public comment on 


draft documents and alternative options [55]. The sec¬ 
ond step is final agency action, which creates “cer¬ 
tainty” to the process and makes the decision subject 
to judicial review. This process is based on the assump¬ 
tion that agencies have the capacity to predict the 
consequences of a “final agency action” [55]. Thus, 
there is a fundamental conflict between linear legal 
processes (i.e., administrative law) based on 
“stationarity,” versus environmental management 
frameworks (i.e., adaptive management) based on the 
realization of dynamic systems characterized by “sur¬ 
prise” [55]. Given this inherent conflict, adaptive man¬ 
agement may not be possible under the current 
administrative law framework [54]. 

The adversarial character of administrative law, 
combined with the need for certainty (e.g., procedural 
rules) in the larger realm of American law, is likely 
incompatible with adaptive management [56]. Thus, 
environmental law is at odds with science, as the cer¬ 
tainty required for socio-political stability makes it very 
difficult to apply a novel approach to ecosystem man¬ 
agement (e.g., adaptive management) that requires 
institutional flexibility. Thus, if adaptive management 
is necessary for good environmental management, 
environmental law must be “adapted” to fit with adap¬ 
tive management [54]. Karkkainen [56] argues that 
administrative law should proceed on two trajectories: 
(1) a fixed rule track that will apply unless an agency 
can justify otherwise; and (2) an adaptive management 
track, where a new set of administrative law standards 
specific to adaptive management would hold prece¬ 
dence, in order to actualize adaptive management as 
a tool for environmental policy. 

Thus, some in the law community argue that adap¬ 
tive management is not possible under the current 
administrative law framework [54]. The National Envi¬ 
ronmental Policy Act (NEPA) may act as a barrier to 
implementation of adaptive management (sensu 
Holling) [57]. NEPA could possibly be modified to an 
iterative process that could accommodate adaptive 
management [57]. Ruhl [54] contends that adaptive 
management is necessary for good environmental man¬ 
agement, which in turn means that environmental law 
must be “adapted” to fit with adaptive management. 

In effect, administrative agencies in the USA do not 
conduct adaptive management as it was originally con¬ 
ceived [55]. Rather, agencies conduct adaptive 
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management “lite,” as the courts have provided some 
leeway to adaptive management projects, provided 
they have requirements that are legally enforceable 
[55] . The primary problem with adaptive management 
“lite” is that it does not measure up to the standards of 
adaptive management theory, nor does it hold up 
under the scrutiny of substantive and procedural law. 
Adaptive management (sensu Holling) is not likely 
until Congress provides more funding for adaptive 
management and clear standards for the adaptive man¬ 
agement process [55]. 

Conclusions 

The conceptual underpinnings for adaptive manage¬ 
ment are simple; there will always be inherent uncer¬ 
tainty and unpredictability in the dynamics and 
behavior of complex ecological systems as a result of 
nonlinear interactions among components and emer¬ 
gence, yet management decisions must still be made. 
The strength of adaptive management is in the recog¬ 
nition and confrontation of such uncertainty. Rather 
than ignore uncertainty, or use it to preclude manage¬ 
ment actions, adaptive management can foster resil¬ 
ience and flexibility to cope with an uncertain future, 
and develop safe-to-fail management approaches that 
acknowledge inevitable changes and surprises. Since 
its initial introduction, adaptive management has 
been hailed as a solution to endless trial and error 
approaches to complex natural resource management 
challenges. However, it does not produce easy answers, 
and it is appropriate in only a subset of natural resource 
management problems. Clearly adaptive management 
has great potential when applied appropriately. 

Future Directions 

Adaptive management is increasingly heralded as the 
future of natural resource management and has been 
adopted by many governmental and nongovernmental 
agencies. Institutions adopting adaptive management 
have utilized different definitions often focusing on 
a single strength of the process (i.e., experimentation, 
reducing uncertainty, involving stakeholders) and thus 
operationalize the practice uniquely. Some, like the 
U.S. Department of Interior, are highly focused on the 
decision process and the incorporation of structured 


decision making while others, such as the US Army 
Corps of Engineers, have embraced stakeholder 
involvement. Each approach has merit but adaptive 
management has failed to live up to its expectations 
[58]. The reasons for failure are many, and likely to be 
repeated, yet the great potential of adaptive manage¬ 
ment remains; unfortunately, it remains largely 
untapped. Translation of adaptive management 
approaches to “on-the-ground” natural resource man¬ 
agers is a critical step that has largely failed. Most natural 
resource managers are still unable to define adaptive 
management, let alone incorporate it into their normal 
management activities. The next decade will be critical: 
Will adaptive management remain in the domain of 
ivory towers, or will it become a tool for the trenches? 
Taking adaptive management to the practitioners will 
require the communication of adaptive management 
techniques in a clear, simple, and most importantly 
applicable manner. Currently, adaptive management 
fails because of an adherence to mathematical modeling 
above all else, its application to situations that are not 
conducive to replication or the measurement of success 
(e.g., large rivers such as the Missouri or the Colorado), 
and because adaptive management has not been ade¬ 
quately incorporated into natural resources manage¬ 
ment via appropriate legal mechanisms [59]. If the 
future of natural resource management is to be proac¬ 
tive and address the increasing uncertainties facing our 
world, adaptive approaches to resource management 
will require communication of the methodology and 
merits in a clear and simple manner. 
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Glossary 

Beta-diversity Also termed species turnover, beta- 
diversity refers to the change in species as one 
moves between habitats, communities, or 
ecosystems. 

Divergence-with-gene-flow model of speciation A 

model explaining the process of species formation 
(speciation) in which new species diverge in the face 
of gene flow; the movement of genes within a group 
that results from mating with immigrant 
individuals. 
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Ecotone A transition zone between two or more 
different ecological communities or regions. 
Ecotone effect The pattern of increased species 
richness (number of species) and abundance in 
ecotones and the occurrence of unique ecotonal 
species. 

Edge effect The effect of the juxtaposition of 
contrasting environments on an ecosystem. 
Geographic information systems (GIS) A computer- 
based system for creating and managing spatial data 
and associated attributes. It enables the capture, 
storage, retrieval, analysis, and display of spatial 
(location-based) data. 

Remote sensing The science and art of obtaining 
information about an object, area, or phenomenon 
through the analysis of data acquired by a device 
that is not in contact with the object, area, or 
phenomenon under investigation (e.g., via a satel¬ 
lite image). 

Definition of the Subject 

Ecotones are areas of transition between ecological 
communities, ecosystems, or ecological regions (such 
as Mediterranean and desert). Ecotones often occur 
along ecological gradients. Such gradients are created 
as a result of spatial shifts in elevation, climate, soil, and 
many other environmental factors. Ecotones com¬ 
monly coincide with areas of sharp climatic transition 
along environmental gradients. They occur at multiple 
spatial scales, from continental-scale transitions 
between major biomes to small-scale ecotones where 
local vegetation communities and microhabitats coin¬ 
cide. They show a diversity of boundary types that 
range from natural boundaries (e.g., altitudinal, latitu¬ 
dinal transitions) to human-generated ecotones (e.g., 
forest clear-cut edges or urban ecotones). Ecotones 
have been studied in the past four decades in an 
ecological context and in recent years are receiving 
increasing attention in the context of biodiversity 
conservation. Various studies have shown that species 
richness and abundances tend to peak in ecotonal 
areas, though exceptions to these patterns occur. 
Ecotones are “natural laboratories” for studying 
a range of evolutionary processes, such as the process 
by which new species form, also termed speciation. 
Some researchers argue that ecotones deserve 


high-conservation investment, potentially serving as 
speciation and biodiversity centers. Because ecotones 
are often small in size and relatively rich in biodiversity, 
conservation efforts in these areas may prove to be an 
efficient and cost-effective conservation strategy. 

Introduction 

While substantial research in the past decades has 
focused on distinct ecological regions, communities, 
and ecosystems, areas of transition between these 
regions have received much less attention. These 
areas, nevertheless, sustain rich and unique biodiversity 
and may have an important role in sustaining future 
biodiversity. However, studies have shown that species 
richness and abundances tend to peak in ecotonal 
areas, though exceptions to this pattern occur. This is 
likely because ecotonal areas hold species from two or 
more neighboring communities and possibly due to 
high spatial heterogeneity and high speciation rates in 
ecotonal areas. In addition, ecotones have been shown, 
in some cases, to hold ecotonal species and genotypes 
that are uniquely characteristic of the ecotone. 

Several studies have found evidence that some spe¬ 
cies are diverging in ecotonal regions, suggesting that 
transitional areas may serve as speciation centers. 
Recent work has shown increased genetic and morpho¬ 
logical diversity in ecotonal regions compared with 
neighboring areas. Ecotonal populations often coincide 
with peripheral populations along a species range. 
Some researchers suggest that because ecotones often 
hold marginal populations that depend on other parts 
of the range for the maintenance of genetic diversity, 
they do not deserve much conservation interest. Alter¬ 
natively, in the face of global climate and environmen¬ 
tal changes, it has also been argued that ecotones 
deserve special conservation investment, potentially 
being speciation and biodiversity hotspots that hold 
relatively resilient populations. This entry will provide 
a short overview of the history of ecotone research, 
recent novel findings in the area and the implications 
for future conservation, biodiversity, and sustainability 
research. 

Conceptual Framework 

Ecotones are areas where ecological communities, 
ecosystems, or biotic regions coincide. They often 
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occur in areas of steep environmental transition, along 
environmental gradients. In these transitional regions, 
the environment rapidly shifts from one type to 
another based on abiotic (e.g., climatic) and/or biotic 
(e.g., community structure) factors [13, 19]. The origin 
of the word “ecotone” is in the Greek roots “oikos” 
(home) and “tonus” (tension). Many different defini¬ 
tions and terms have been used in the literature to 
describe ecotones and areas of ecological transition. 
These include boundary regions, borders, meeting 
zones, transitional zones, tension zones, zones of 
intermingling, and zones of transgression [19]. The 
definition often contrasts ecotones with more homog¬ 
enous areas found on both sides of the transition or to 
the landscape as a whole. Ecotones can occur in both 
terrestrial and aquatic systems, and cover several spatial 
scales, from large spatial-scale ecotones, where biomes 
meet [40] to local-scale transitions, such as mountain 
treelines ([11]; see Fig. 1). These areas are sometimes 
considered to be dynamic zones of interaction between 
communities, which are unstable over time [19]. As 
suggested by Odum [30], Ecotones do not simply rep¬ 
resent a boundary or an edge; the concept of an ecotone 
assumes the existence of active interaction between two 
or more ecosystems with properties that do not exist in 
either of the adjacent ecosystems. Ecotonal regions 
show a diversity of transitions that range from natural 
transitions (e.g., altitudinal and latitudinal transitions) 
to human-generated ecotones, sometimes termed 
anthropogenic ecotones (e.g., forest clear-cut edges or 
urban ecotones) [24], as shown in Fig. 1. 

History of Research on Ecotones and Biodiversity 

For over 100 years, since the late nineteenth century, 
there has been scientific interest in boundary regions 
and edges of ecological systems. A conceptual ecologi¬ 
cal framework for the study of ecotones was given 
by Odum in 1953. Odum discussed the ecotone effect, 
which he characterized as increased richness and 
abundance in ecotones and the occurrence of unique 
ecotonal species. Until the 1970s, there was consider¬ 
able interest in ecotones within the scientific commu¬ 
nity (reviewed by [32]). This interest subsided as focus 
on more homogenous and well-defined ecosystems and 
communities (e.g., tropical rain forests and tundra) 
became common. A revival of research in the field 


focusing on ecotones and their effects on biodiversity 
was seen in the late 1980s and 1990s, with the develop¬ 
ment of new research areas, especially those of land¬ 
scape ecology and of conservation biology [11, 13]. 
Studies on ecotones in the 1980s often focused 
on material flow (e.g., water and nutrients) across 
communities and on ecosystem processes in these 
boundary regions [12]. Much of the work focused on 
wetlands and on riparian zones, where land-water 
interfaces occur (reviewed in [45]). Eater work in the 
1990s more directly examined the effect of ecotones on 
biological diversity, and especially on the relationship 
between ecotones and processes leading to morpholog¬ 
ical divergence, patterns of genetic and phenotypic 
diversity, species richness, rarity, and their conserva¬ 
tion implications (reviewed in [17]). 

Approaches for Measuring Ecotones 

Due to the fact that ecotones can be rarely delimited by 
a fine line, their measurement and mapping is not 
simple. A wide range of research approaches and tools 
have been used to detect and quantify ecotones. These 
include, among others, simulation modeling, geo¬ 
graphic information systems (GIS), remote sensing, 
and statistical tools that enable quantification and anal¬ 
ysis of ecotones of different types and over several 
spatial scales. Diverse approaches for the quantification 
of the steepness of gradients exist [17]. Methods for 
measuring and characterizing ecotones depend on the 
data available (e.g., quantitative or qualitative, grid- or 
transect-based data), with one of the simplest 
approaches, proposed by Womble in 1951, being the 
quantification of the magnitude of the first and second 
derivatives (rates of change in a given variable or 
several variables along a spatial gradient) [44] . These 
approaches often examine the values of variables in an 
area (e.g., a 1 x 1 km grid square) relative to its 
neighboring regions. The basic idea is to detect areas 
of sharp environmental transition by finding the areas 
with the highest rate of change in the value of a given 
variable or several variables between adjacent squares 
(pixels). Specific software for the detection of bound¬ 
ary regions and analysis is now available (e.g., 
BoundarySeer: http://www.terraseer.com/products_ 
boundaryseer.php), enabling more widespread use of 
advanced statistical tools for the study of areas of 
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Ecotones and Ecological Gradients. Figure 1 

Satellite images showing a variety of natural and human-generated ecotones at several spatial scales: (a) Ecotones in the 
Sahel region of Africa (see text). (b)Altitudinal ecotones between vegetation belts in Mt. Kenya, (c) Natural and human- 
made ecotones on Hawaii's big island. Ecotones vary over different slopes and elevations. Note also the sharper ecotones 
in areas where lava has flowed, (d) Human-related ecotones resulting from deforestation (in pink) in the Amazon Basin, 
Brazil. On the far left and far right, in dark green, natural riparian ecotones can be detected along the rivers. (Source of 
Fig. a Stockli et al. [39] NASA Blue Marble Next Generation, August 2004. True color composite of MODIS satellite images. 
Downloaded from: http://visibleearth.nasa.gov/ [42].) (Source of Fig. b-d MDA Federal [25] NASA GeoCover circa 1990 
Landsat images. False color composite of Landsat bands 7, 4 and 2. Green shades represent vegetation, magenta and 
brown represent bare soil, black represent water bodies, recent lava flows of shadows, cyan shades represent snow, and 
white represents clouds. Downloaded from: https://zulu.ssc.nasa.gov/mrsid/ [3].) (All images are public domain. The 
figures were produced by Noam Levin) 


transition [5, 8]. These tools can also be applied to the 
study of ecotonal regions. 

In recent years, new approaches to quantify changes 
in diversity across gradients and boundary regions have 
been developed and are being applied. Among these is 


a range of new beta-diversity estimates of species turn¬ 
over in space [20, 26] . These have been developed in the 
past decades, since Wilson and Shmidas [43] review on 
beta-diversity estimates. Beta-diversity and species 
turnover are often used when studying gradients, and 
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although they do not focus necessarily on ecotonal 
areas, they can be applied to the study of ecotones. 

One of the most promising directions in ecotone 
and boundary measurement is the use of tools devel¬ 
oped in other areas of science. These include fields such 
as physics, remote sensing, and image analysis, where 
substantial advancements in boundary detection and 
gradient quantification methods have been made. 
Remote-sensing tools, for example, can use data occur¬ 
ring over several orders of magnitude, using satellite- 
derived data currently available at a resolution of 0.5 m 
to 100 km. Further application of these tools at multi¬ 
ple spatial resolutions will provide a better understand¬ 
ing of ecotones. Recent work has shown that remote 
sensing tools can be effectively used to detect ecotones 
and to predict species richness and rarity (e.g., range 
size rarity) in ecotones, especially in mountains where 
latitudinal gradients occur (e.g., [22, 23]). 


more bird species, and also scored more highly in terms 
of species rarity. The findings of their work suggest that 
transitional environments harbor many rare species, in 
addition to high richness. At the community level, 
there is also some evidence for high species richness 
in ecotonal areas in marine systems. For example, van 
Rensburg et al. [41] showed at the subcontinental scale 
in South Africa that species richness and range size 
rarity at a spatial resolution of quarter degree are gen¬ 
erally negatively correlated with distance to areas of 
transition areas between vegetation communities for 
both birds and frogs. Areas with more range-restricted 
species were located significantly closer to areas of 
transition areas between vegetation communities than 
expected by chance [41] . Similarly, in the Gulf of Aden, 
Kemp and colleagues [18] found high-reef fish diversity 
in an ecotone harboring a unique mixing of the three 
distinct faunas of Oman, the Red Sea, and the Indian 
Ocean. 


E 


Patterns of Biodiversity in Ecotones 

There have been multiple studies on patterns of biodi¬ 
versity in ecotonal areas, which have led to a range of 
results. Recent work is providing increasing evidence 
that boundary regions between ecological communities 
can be highly diverse at both the within-species and 
community levels. Ecotones have been shown to hold 
especially high biological diversity over several spatial 
scales, at both the community level (when examining 
species richness, i.e., the number of species in an area) 
and at the within-species level (morphological and 
genetic diversity) as reviewed by Kark and van 
Rensburg [17]. Other studies, however, have shown 
conflicting results, making it difficult to generalize 
without carefully examining each case, community, 
and region. 

Early on, Odum [30] pointed at high species rich¬ 
ness and abundance in ecotones and suggested that the 
ecotonal community commonly contains many species 
that are characteristic of, and sometimes restricted to, 
the ecotone. In a continental-scale study of New World 
birds, Kark et al. [16] examined the relationship 
between passerine richness and rarity of ~2,300 
passeriform species in 4,889 one-degree New World 
grid cells and the distance of the cells to boundaries 
between adjacent plant-based ecoregions. They found 
that areas nearer to transitions between ecoregions had 


Processes Shaping Biodiversity and 
Sustainability in Ecotones 

Ecotones and Evolutionary Processes 

Ecotones are “natural laboratories” for studying 
a range of evolutionary processes, such as the process 
by which new species form, also termed speciation. 
This process is of major interest to evolutionary biolo¬ 
gists who define three major types of speciation: allo- 
patric, parapatric, and sympatric. These models are 
based on the degree of geographical subdivision 
between populations that lead to the formation of 
new species. Allopatric speciation happens in geo¬ 
graphical isolation, and has been for many years 
considered the major form of speciation. Parapatric 
speciation occurs in adjacent populations with gene 
flow among them, often along dines. Sympatric 
speciation occurs when populations are geographically 
congruent, and are found in the same area. The study of 
ecotones has led to a better understanding of the poten¬ 
tial importance of parapatric and sympatric speciation 
as mechanisms for speciation. Ecotones have been 
proposed by some authors to be centers of evolutionary 
novelty that maintain evolutionary process, and as 
regions where parapatric (or sympatric) speciation 
processes may take place [33]. As such, ecotones and 
areas of environmental transition have been suggested 
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as natural laboratories where evolutionary processes 
and barriers to gene flow can be examined [33]. 

A review by Moritz et al. [27] summarized the 
major models of evolutionary processes that promote 
diversification of rain forest faunas. They include the 
divergence-with-gene-flow model, which suggests that 
adaptive divergence caused by selection forces occurs 
across environmental gradients, leading to speciation 
even in the face of gene flow across ecotones. This 
means that speciation does not require isolation in 
cases where selection is strong enough to separate 
populations. This process is expected to occur espe¬ 
cially where very different environments meet in the 
ecotone, for example, at the border between a dry and 
wet rain forest. This may mean that even when gene 
flow continues, strong selection pressures can lead to 
divergence. 

Support for the divergence-with-gene-flow model 
comes from recent research examining divergence 
using molecular genetic, phenotypic, and experimental 
approaches [27, 33, 34, 37]. Smith et al. [37], studying 
the little greenbul ( Andropadus virens ), a passeriform 
bird in the rain forest-savannah ecotone region of 
Cameroon, found especially high morphological diver¬ 
gence in the ecotone. The authors proposed that when 
the ecotone is large enough, natural selection processes 
could be strong enough to generate morphological 
differences similar to those seen in reproductively iso¬ 
lated species even when high rates of gene flow occur. 
Their data support the divergence-with-gene-flow 
model of speciation [38], leading them to propose 
that ecotones may be integral to the production and 
maintenance of high biodiversity in tropical rain for¬ 
ests. Quantification of morphological and geographic 
distances in olive sunbird ( Nectarinia olivacea ) 
populations in West African forests and ecotones 
revealed similar divergence patterns [38]. 

Smith studied morphological divergence in another 
Central African species, the black-bellied seedcracker 
(Pyrenestes ostrinus). Seedcrackers show polymor¬ 
phism in bill size. A megabilled morph was found in 
the ecotone that specializes on a very hard-seeded sedge 
found only in ecotonal areas [27]. This ecotonal 
megamorph was maintained in the population, despite 
high levels of gene flow with rain forest populations 
that had only smaller-billed morphs, owing to its selec¬ 
tive advantage for feeding on the hard seeds. 


Morphological divergence between habitats across an 
ecotone was also found in leaf-litter skinks ( Carlia 
rubigularis ) in the wet tropical rain forest of Australia 
[34]. Adult skinks occurring across sharp ecotones 
from open (wet sclerophyll) forests to adjacent rain 
forests showed large morphological and life history 
differences over short distances despite moderate to 
high levels of mitochondrial gene flow [34]. 
Populations occurring across the ecotone had larger 
size differences than populations located dozens to 
hundreds of kilometers away that were geographically 
isolated millions of years ago, suggesting that in this 
case, and perhaps in others that await research, speci¬ 
ation with gene flow may have importance. 

If ecotones regions harbor unique and endemic 
species and alleles, this may provide support for the 
notion that these regions may also have importance in 
speciation. If this is the case, ecotonal regions are 
expected to contain a preponderance of recently 
derived species that are yet to expand their ranges 
(neoendemics). Fjeldsa and Rahbek [7] suggested that 
more recently evolved species are concentrated in tran¬ 
sitional ecotones surrounding the main central African 
rain forest. This reasoning is consistent with the finding 
that terrestrial ecotones sustain high morphological 
divergence, providing evidence that current speciation 
processes may indeed be taking place in these regions 
[27, 33, 37]. 

Ecotones, Ecological Gradients, and Species Ranges 

One of the important questions that arise is what are 
the processes that lead to higher species richness in 
ecotonal regions compared with adjacent areas? Evolu¬ 
tionary processes, as discussed above, may be one 
explanation. Ecological factors may also have impor¬ 
tance in shaping this pattern. Several authors, such as 
Gosz [11] and Risser [32], have suggested that transi¬ 
tional areas not only share the two types of environ¬ 
ments of the habitats that coincide in the ecotone, but 
also have a unique ecotonal environment. Indeed, 
Odum [30] proposed that transition zones often 
support a unique community with characteristics addi¬ 
tional to those of the communities that adjoin the 
ecotone, although also commenting this is by no 
means a universal phenomenon. Studies testing these 
predictions show mixed results, some pointing toward 
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the occurrence of ecotonal species, while others not 
finding evidence for species unique and highly abun¬ 
dant in ecotones. The inconsistency among studies is 
complicated by the fact that different species, systems, 
scales, and regions were used in different studies or 
due to methodological factors, such as differences in 
sampling and analysis approaches. 

In addition, ecotones tend to shift in space and time 
over several spatial scales [11, 19], as a response to 
climatic variation, other environmental changes [6, 
19, 29], and human activity [10]. Ecotones show high 
spatial and temporal heterogeneity, which may serve as 
important factors contributing to their high genetic 
and species diversity [32]. For example, multiple eco¬ 
tones can be defined within and around the African 
Sahel (Fig. la), depending on the scale of interest and 
on the definitions used [1]. The different transitions 
(e.g., that between the Sahel and the desert to the 
north) experience shifts in time and space, showing 
high spatiotemporal variability. Another, simpler, pro¬ 
cess shaping this pattern is that ecotones, comprising 
meeting areas between adjoining communities, include 
a combination of species from two or more community 
types [32]. Ecotonal areas often comprise the edge of 
the range for species on both sides and are where many 
peripheral populations occur [17, 35]. 

An important question is whether populations 
occurring in ecotonal areas are viable populations 
that exist over time within ecotones, or are rather 
present temporarily due to the constant flow of indi¬ 
viduals from other parts of their range into the ecotone 
areas, are not self-sustainable over time, and will dis¬ 
appear if this flow is stopped. Shmida and Wilson [35] 
proposed that the high number of species in transi¬ 
tional areas could be due to a process they called the 
mass effect, which is the flow of individuals from favor¬ 
able to unfavorable areas. For species that reach the 
margins of their range at the ecotone, this effect may 
result in some individuals of a given species 
establishing in ecotonal areas where they cannot main¬ 
tain viable populations, existing in sinks adjacent to 
larger source populations [35]. This may lead to 
increased species richness in ecotonal areas, which is 
maintained by constant immigration of individuals 
from more favorable environments. Some evidence 
for the existence of mass effects can be found in the 
literature, yet these effects seem to be rather weak, and 


it is currently unclear whether they can indeed act as a 
major factor generating high diversity in ecotones [21] . 

The mass effect, however, does not predict the 
occurrence of unique or endemic ecotonal species or 
genotypes. If some species or genotypes are character¬ 
istic of an ecotone or occur at the ecotone in higher 
abundances than in the neighboring habitats, as 
proposed above, this could suggest that some ecotone 
populations are ecologically viable. Several recent stud¬ 
ies have found peak genetic and morphological diver¬ 
sity within species in ecotone regions, with populations 
in these regions harboring unique and rare alleles not 
found elsewhere [17]. For example, a study examining 
allozyme (protein level) diversity within chukar par¬ 
tridge (Alectoris chukar) populations across a rainfall 
gradient from northern to southern Israel found that 
the highest levels of diversity occur in the sharp ecotone 
area between the Mediterranean region and the desert, 
in the northern Negev area. Populations in this area not 
only showed higher genetic diversity based on 32 loci 
(proteins) examined, but also had unique and rare 
alleles that did not occur elsewhere across the range 
[ 14] . The same species also had peak levels of morpho¬ 
logical diversity in the ecotone region based on 35 traits 
and 23 ratios between traits [15]. More studies on 
genetic and species uniqueness in ecotonal areas are 
needed to enable generalizations. 

Agricultural, Urban and Human-Related Ecotones 

The early literature discussing ecotones largely dealt with 
natural ecotones that are generated by environmental 
factors such as soils, geology, and climate. More recent 
research is increasingly including human-related bound¬ 
aries. Human activity is generating boundaries that did 
not exist before, changing their steepness and shifting 
ecotonal location. These boundaries include a diversity 
of ecotone types, such as forest clear-cut edges, margins 
between built-up and natural landscapes, and human¬ 
generated features, such as lakes and plantations. 
Research on these human-generated ecotones and their 
effects on biodiversity are related to a study area that is 
sometimes termed “countryside biogeography,” exam¬ 
ining biodiversity in human-dominated landscapes. 

Another recently developing research area that is 
relevant is that of urban ecology. Sharp, human-caused 
transitions may result from human activities such as 


E 




3364 


Ecotones and Ecological Gradients 



urbanization, land-use changes, agriculture, grazing, or 
burning (see Fig. 1). These boundaries occur at multi¬ 
ple spatial scales, ranging from local ecotones between 
agricultural plots, urban areas, and roads, and their 
neighboring native habitat, to large-scale ecotones 
such as shifting desert borders owing to desertification 
processes and river divergence (Fig. 1). These ecotones 
may be either static and fixed in space or dynamic and 
shifting in location over space and time. 

A substantial amount of work in both natural and 
human-related landscapes has focused on what has 
been called the “edge effect.” This is the effect of the 
juxtaposition of contrasting environments on an eco¬ 
system. It refers to how the local environment changes 
along some type of boundary, or edge and how biodi¬ 
versity is affected by such edges. This idea is attributed 
by animal ecologists to Aldo Leopold and his 1933 
book Game Management. It encompasses a wide 
range of both biotic and abiotic trends associated 
with boundaries between adjacent habitat types, natu¬ 
ral or anthropogenic. Much of the reference to edge 
effects in the recent landscape ecology literature has 
been related to human-caused boundaries, and espe¬ 
cially to boundaries between forest fragments and 
neighboring patches of habitat that have been cleared. 
Again, there are no clear-cut conclusions as to the effect 
of human-generated ecotones on biodiversity. 
The response largely depends on the type of edge and 
its history as well the species in focus. 

Its conservation implications are therefore complex 
and deserve further scientific attention. A vast amount 
of research has been done on the effect of forest edges, 
especially the effect of human-made forest fragments 
and their edges on biodiversity. A review of edges [28] 
suggested there are many discrepancies in the literature, 
and a better understanding and search for general pat¬ 
terns requires a much more mechanistic approach to 
examine the processes underlying such patterns. In this 
context, there has been much effort to understand the 
effect of forest edges on predation, brood parasitism, 
and the breeding success of birds. Paton [31] found 
that in the majority of studies, nest success varied near 
edges with an increase in both depredation and brood 
parasitism rates. The most conclusive studies suggest 
that edge effects in birds usually occur within 50 m of 
an edge [31]. Since these reviews, multiple studies on 
dozens of species and regions have been conducted. 


Implications for Biodiversity, Conservation 
Planning, and Sustainability 

Whereas better understanding of the effects of eco¬ 
tones on biodiversity is important for evolutionary 
and ecological pure science purposes, it also has 
many implications for conservation of biodiversity, 
sustainability, and practical decision-making. Some 
scientists have argued that because ecotones hold mar¬ 
ginal populations at the edge of the range of many 
species, where abundances may be lower and 
populations more prone to local extinctions than 
other parts of the range, these ecotonal areas have 
low value for conservation as they will not persist 
over time, for example, if the surrounding environ¬ 
ments become fragmented (e.g., [9]). There has been 
an interesting discussion in the literature in recent years 
regarding whether transitional areas are valuable for 
conservation (e.g., see Smith et al. [36] versus Brooks 
[4]). Substantial conservation attention has been given 
in recent years to the understanding and mapping of 
biodiversity patterns and the underlying processes, and 
toward predicting the effects of global change. Ecotone 
and boundary regions, where change, shifts, and vari¬ 
ability occur naturally in both space and time, could 
serve as useful models for understanding, monitoring, 
and predicting the response of individuals, populations, 
and communities to changing environments [2, 6, 29]. 
In addition, while some scientists suggest that ecotone 
populations are more likely to be negatively affected by 
climate change [9], other researchers have argued that 
ecotonal areas sustain populations that are adapted to 
changing, fluctuating, and unstable environments and, 
as such, these populations may better be able to persist 
in the face of predicted change (see further discussion 
in [17]). This is especially relevant since climate shifts 
are expected, according to some studies, to be rapid 
and extreme in boundary regions between ecosystems 
[2]. By examining changes in ecotone locations over 
time, these areas may potentially serve as “early warn¬ 
ing” indicators of global changes [6, 29]. However, the 
response depends on the spatial and temporal scales 
examined and may be a useful indicator mainly at 
global spatial scales and rather coarse timescales. 
Therefore, this area deserves further attention owing 
to the complexity of the factors shaping the location of 
ecotones in space and time. 
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Much research in the past decade has focused on 
prioritizing conservation efforts and determining what 
areas are most important and valuable for conserva¬ 
tion. Two main approaches have been suggested: The 
first approach includes a search for biodiversity 
hotspots, or areas with especially high species richness, 
endemism, and rarity. A second approach has been to 
select areas that are complementary, and hold biodiver¬ 
sity not present in other areas. Ecotones may provide 
a unique opportunity to conserve both high species 
richness and high complementarity. Due to their rela¬ 
tively small size, it may prove a cost-effective strategy to 
further conserve ecotone regions and to explicitly 
include them in future systematic conservation plan¬ 
ning, given they potentially provide a high return on 
investment having small area and high biodiversity. 
Undoubtedly, if these areas have the potential to main¬ 
tain and to generate species richness as well as unique 
and novel species and forms, they deserve far greater 
research attention than they are currently receiving 
[36]. Nevertheless, conservation plans for ecotones 
should not be considered independent of their sur¬ 
rounding environments. Ecotonal regions are impor¬ 
tant for our understanding of evolutionary (such as 
speciation, divergence with gene flow, and adaptation) 
and ecological processes shaping biodiversity (e.g., 
response of populations to fluctuating environments). 
They may enable us to better predict the responses of 
populations to environmental change and to further 
identify previously neglected biodiversity hotspots 
valuable for biodiversity research and conservation. 

Future Directions 

Research on the effect of ecotones on biodiversity is an 
important future direction in the face of global change, 
including land-use and climate change. Because eco¬ 
tones are areas where shifts in environmental condi¬ 
tions occur in space and time, they may serve as useful 
indicators of environmental change and of the response 
of ecosystems to shifts in climate. The degree of effec¬ 
tiveness of ecotones as early predictors of the impacts of 
change and the ways that ecological communities and 
systems respond to change may be an important future 
direction. Ecotones can sustain unique forms or species 
that are less abundant or do not occur elsewhere. In 
addition, studies suggest that ecotones are areas where 


some populations are diverging to new species in the 
face of gene flow (across the ecotone). It is important to 
examine the generality of these patterns and processes 
in different regions, spatial and temporal scales, and 
groups. 

Another future direction is the application of 
multi-disciplinary approaches from other fields of sci¬ 
ence in studying and quantifying ecotones. For exam¬ 
ple, much work has been done in physics on transitions 
between states (liquid-solid etc.), which could be 
applied to work on environmental transitions ecotones. 
More work on the interactions between native and alien 
species and the role of ecotones in these interactions 
should be done to examine the hypothesis that eco¬ 
tones are barriers to invasion between communities. 
The importance of ecotones to the generation and 
conservation of biodiversity, especially in the face of 
global climate and other environmental changes is an 
area of potential research and conservation investment 
that should further be explored and studied. The role of 
ecotones in promoting sustainability and its emerging 
science is an open area, which should be further 
explored in future work. 
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Electric coupling The method by which two electric 
powers are added together. 

Electric vehicle An electric vehicle is referred to as a 
vehicle that uses electric energy as its energy source, 
generally chemical batteries and/or ultracapacitors 
and electric motor as its power source. 

Fuel cell vehicle A fuel cell vehicle is referred to as 
a vehicle that uses fuel cells and electric motor as its 
power source. 

Hybrid electric vehicle A hybrid electric vehicle is 
referred to as a vehicle in which two energy sources 
and correspondingly two power sources are 
installed, which are selectively used to power the 
vehicle. The primary energy source and power 
source may be chemical fuels and internal combus¬ 
tion engine. The secondary energy source and 
power source may be chemical batteries and electric 
motor. 

Mechanical coupling The method by which two 
mechanical powers are added together. 

Operation mode Drive train operation status in which 
power sources are selectively used as the active 
power mover to propel the vehicle. 
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Glossary 

Architecture The mechanical and/or electrical 
structure that defines the energy flow routes in a 
vehicle. 


The architecture design of an electric vehicle is to 
design its energy flow route from the energy source to 
the energy destination. Usually, chemical batteries are 
used as its energy source and electric motors are used to 
produce mechanical power to propel the vehicle. For 
satisfying vehicle performance, the batteries, electric 
motor, and transmission should be properly designed. 

A hybrid electric vehicle uses two power sources to 
power the vehicle. There are many methods to connect 
the two power sources together, each of which has its 
special operation characteristics. For different vehicles 
that have different mission requirements and operation 
environments, a special architecture should be used to 
fully use its advantages and avoid its shortcomings. 

Fuel cell vehicle is the vehicle which is powered by 
a fuel cell system. Compared with internal combustion 
engine, fuel cell is more efficient and cleaner. Architec¬ 
tural design of a fuel cell vehicle consists in designing 
the drive train structure and components to meet the 
vehicle performance requirements. 
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Introduction 

Electric vehicle (EV) is referred to as a vehicle that 
employs electric energy storage as its energy source 
and electric machines as its power source. Electric 
energy is a multisource energy type which can be 
obtained from many primary energy sources, such as 
traditional fossil energy sources (coal, petroleum, and 
natural gas), nuclear energy, hydraulic power, 
bio energy, solar energy, wind energy, etc. With contin¬ 
uous depletion of petroleum resources and more and 
more concerns about environmental issues, it has been 
well recognized that electric vehicle is one the most 
viable substitutes to the current petroleum-fueled vehi¬ 
cle. Electric vehicle also possesses other advantages of 
absence emissions, high efficiency, quiet and smooth 
operation, etc. With continuous progress on the tech¬ 
nologies of chemical batteries, electric propulsion sys¬ 
tems, and electronic control, electric vehicles are much 
closer to meeting a user’s requirements than ever 
before. 

Compared with petroleum fuels, the energy density 
of chemical batteries is much lower, which results in 
shorter driving range per battery charge. Long time 
battery charging also causes incontinence to users. At 
present, the battery-powered electric vehicles cannot 
challenge the petroleum-powered vehicles in terms of 
performance and use convenience. Hybrid electric 
vehicles are the combination of conventional vehicles 
and battery-powered electric vehicles. They can adopt 
the advantages and avoid the shortcomings of both. 
It has been recognized that hybrid electric vehicles, 
especially plug-in hybrid vehicles, are the major 


substitutes of conventional vehicles before technologies 
of chemical batteries have substantive progress in 
energy density, safety, and cost. 

Fuel cell vehicles have been considered to be the 
final solution as the future high-efficiency and clean 
vehicles. Compared with conventional vehicles, fuel cell 
vehicles are independent from the petroleum resource 
and potential has high efficiency and zero emission. 
Compared with battery-powered electric vehicles, 
they potentially have the advantages of long driving 
range and quick fuel replenishing. However, at present, 
the hydrogen sources and onboard storage are the 
major challenges. 

Architectures of Electric Vehicles 

In principle, electric vehicles share the common char¬ 
acteristics of internal combustion engine-powered 
vehicles. A chemical battery pack replaces the fuel 
tank as its energy storage. Electric machine replaces 
the internal combustion engine as the power source. 
Both share the similar performance requirements, such 
as, speed, acceleration, gradeability, and driving range. 
The design principles as discussed in the chapter of 
► Vehicle Dynamics and Performance are still valid 
for electric vehicles. However, unique operating char¬ 
acteristics of the electric propulsion system have to be 
involved into the design process. 

Previously, electric vehicles were designed to have 
a similar drive train structure to conventional internal 
combustion engine-powered vehicles as shown in 
Fig. 1. It simply replaces the engine with an electric 
motor and fuel tank with a chemical battery pack. With 
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development of the technologies of light weight, com¬ 
pact electric drive system, and advanced control, an 
electric vehicle drive train can be designed to various 
configurations as shown in Fig. 2 [ 1 ] 

Figure 2a shows the first alternative configuration 
that is most alike a conventional one. An electric 
motor (EM) drive replaces the internal combustion 
engine. A clutch is used to connect the motor drive to 
or disconnect from the final drive and differential. 
A transmission gear box (GB) is used to modify the 
output speed-torque profile for meeting the perfor¬ 
mance requirement and supply a reverse gear. This 
drive train may be used for the motor drives which 
are only capable of operating in one or two quadrants, 
forward motoring and/or forward regeneration. Since 
a transmission gearbox is used, the torque-speed pro¬ 
file may not be required to close the ideal profile, 
which has constant torque in low speed range and 
constant power in high speed range. A motor drive 


with simple control may meet the requirement, such 
as series-excited DC motor. 

Similar to the configuration shown in Fig. 2a, b 
shows the configuration in which clutch is not further 
needed. The motor starts producing its torque 
from zero speed. The transmission gear box may be 
either multi-gear or single gear, depending on the 
motor torque-speed profile. With a multi-gear trans¬ 
mission, the motor torque needs to be stopped during 
shifting. This can be carried out by cutting the current 
supply to the motor drive by the motor controller. 

Figure 2c illustrates a configuration with a more 
compact design. A fixed gear ratio single gear transmis¬ 
sion (FGB) is installed between the motor drive and the 
final drive (differential). This design requires the motor 
drive to be capable of producing a torque-speed profile 
which is closer to the ideal one. It also requires the 
motor be capable of operating in four quadrants, that 
is, forward and backward motoring and generating. 
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Possible electric vehicle configurations [1] 
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Figure 2d illustrates a configuration in which two 
motor drives are used. Each motor drives an individual 
wheel through a fixed gear ratio single gear box (FGB) 
which is used to match the motor speed to the wheel 
speed. A differential function is carried out by control¬ 
ling the two motors. One advantage of this configura¬ 
tion is that it can fully utilize the road adhesive 
capability to improve the drivability of the vehicle. 
For example, when one wheel is on a slip road and 
loses its drive force, another wheel can produce a big 
driving force to help the vehicle get out of trouble. 

The drive train shown in Fig. 2e is very similar to 
the one in Fig. 2d. The difference is that the two fixed 
gear ratio single gear boxes (FGB) are attached to 
wheels rather than to the vehicle frame. This design 
can save much space and may make packaging easier. 

Figure 2f shows a simplest drive train structure, in 
which the wheel is designed as the rotor of the motor. 
This configuration is the most compact one and saves 
a lot space. This structure is generally termed as 


wheel motor. In the design of the drive trains shown 
in Fig. 2e, f, the impact of additional wheel weight to 
vehicle handing and vibration performance should be 
considered. 

In all the architectures, the electric motors are 
powered by a battery pack and corresponding motor 
controller(s). 

Except electric motor(s) and corresponding trans¬ 
missions, a modern electric vehicle also needs other 
components for fulfilling its functions as shown in 
Fig. 3. The drive train consists of three major subsys¬ 
tems: electric propulsion subsystem, energy storage 
subsystem, and auxiliary subsystem. These three sub¬ 
systems are interfaced through power routes and con¬ 
trol signal routes. The electric energy from power grid 
is charged to the energy storage (batteries) through an 
energy charge system (battery charger). The energy 
storage delivers its energy to or absorbs energy 
from the electric motor through an electronic power 
converter which functions as a motor controller. 
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Conceptual illustration of an electric vehicle configuration [1] 
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The electric motor operation is controlled by a vehicle 
controller which commands the torque requirement to 
the motor controller based on the accelerator or brake 
pedal position and energy source operation parameters 
and energy management algorithm. The energy source 
also supplies energy to the vehicle auxiliaries. A battery 
management system (BMS) is used to monitor the 
operation of the battery pack, generate operation infor¬ 
mation, such as state-of-charge (SOC), and protect 
battery pack from abusing. 

Architectures of Hybrid Vehicles 

Hybrid vehicles (HV) are referred to as the vehicles that 
are powered by at least two energy sources and two 
power sources, in which at least one energy source and 
one power source have bidirectional energy flow capa¬ 
bility, delivering energy to drive wheels and accepting 
energy from braking. When electric energy source and 
electric power source are used, they are referred to as 
hybrid electric vehicles (HEV). It is possible to replace 
the electric system with a hydraulic system. This kind of 
hybrid vehicle is referred to as hydraulic hybrid vehicles 
(HHV). Fuel cell vehicles (FCV) are usually referred to 
the vehicles in which fuel cells are used as the primacy 
power source. A fuel cell vehicle may use two energy 
sources, fuel cells and chemical batteries, which is 
referred to as fuel cell hybrid vehicles (FCHV). 

Since there are two power sources and only one 
energy sink (wheels), there are many ways to combine 
the two powers together. The design of the energy flow 
routes are referred to as drive train architecture design. 
Control of the energy flows in hybrid drive train is 
referred to as vehicle control strategy. 


Concept of Hybrid Vehicle 

In a hybrid vehicle, two power sources supply powers to 
drive wheels. The total power should meet the vehicle 
performance requirements, in terms of maximum 
speed, gradeability, acceleration time requirement, 
and driving range. The vehicle performance requires 
the vehicle possessing sufficient power, and sufficient 
energy for continuous driving range. In operation, any 
one of the two power sources or both can be used for 
traction. Thus, there are many possible operation 
modes as shown in Fig. 4 [2]. 

The primary energy source and energy converter 
allows unidirectional energy flow, whereas, the 
secondary energy source and energy converter are 
required to have the capability of bidirectional energy 
flow. With today’s technologies, the common primary 
energy sources are chemical fuels and primary energy 
converters are internal combustion engines (ICE) 
and fuel cells. The secondary energy sources are 
chemical batteries and/or ultracapacitors and the 
secondary energy converters are electric motors. The 
final destination of the energy flow (energy sink) 
is the vehicle wheels with the form of mechanical 
energy. 

As shown in Fig. 4, the wheels can be powered by 
selecting power sources, which are referred to as oper¬ 
ation modes, which include: 

1. Primary power source alone propelling 

2. Secondary power source alone propelling 

3. Both power sources propelling (hybrid drive) 

4. Regenerative braking (secondary power source 
absorbs energy form load) 


E 


Unidirectional 
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Concept of hybrid electric vehicles 
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5. Primary power source splitting for propelling and 

secondary power source charging 

So many available operation modes create higher 
flexibility over single power source vehicles. It provides 
the drive train with opportunities for maximizing its 
operation efficiency and/or minimizing its emissions at 
any driving conditions by selectively using a best oper¬ 
ation mode. 

Vehicle load in normal driving varies randomly as 
shown in Fig. 5. Conceptually, the random load is 
composed of two components, one is the steady 
power (average) and the other is the dynamic power 
with zero average [2]. Ideally, hybrid vehicle can be 
designed and controlled in such a way that one power 
source supplies a constant power for the average load, 
which is referred to as energy source, since all the energy 
comes from it, and the other power source supplies 
dynamic power, which is named peaking power source, 
since it consumes zero energy in a whole driving cycle. 

In normal driving, average power is relative small 
(few 10 kW). Thus, the primary power source can be 
designed with a small power capacity. When the inter¬ 
nal combustion engine is used as a primary power 
source, it can be downsized and operated near its 
most efficient region. In addition, other types of 
engines may be used, such as Sterling engine, gas tur¬ 
bine engine, and others. The secondary power source 
may be batteries, ultracapacitors, flywheels and their 
combination, and electric motors. 

Architectures of Hybrid Vehicles 

Architecture of a hybrid vehicle is loosely defined as the 
ways by which the components of a hybrid drive train 
are connected and, therefore, the energy flow routes are 
schemed. Traditionally, hybrid vehicles are classified 
into series, parallel and series/parallel. However, the 
definitions of these configurations have not been clear. 


As discussed in a prior section, a hybrid vehicle has two 
power sources and one energy sink. Thus, there has to 
exist at least point at which two powers are merged 
together. At this point, the power forms have to be 
identical, mechanical or electrical. In this chapter, the 
terms of electric coupling and mechanical coupling are 
used rather than the traditional names. Figure 6 dem¬ 
onstrates the concepts of four configurations. 

Figure 6a shows the configuration of an electric 
coupling, traditionally, a series configuration. Two 
powers are merged together at the power converter in 
the form of electrical. An internal combustion engine 
drives an electrical generator to supply electric power 
to the electronic converter, and at the same time, the 
battery pack also contributes its power to the power 
converter. The output power from the power converter 
flows to the electric motor to produce mechanical 
power and then drives vehicle wheels. The power shares 
are controlled by the power converter. 

Figure 6b shows the configuration of mechanical 
coupling, traditionally, a parallel configuration. The 
internal combustion engine directly contributes its 
mechanical power to a mechanical coupler. At the same 
time, the battery pack supplies its power to the electric 
motor through a power converter which functions as the 
motor controller for controlling the motor torque. Two 
mechanical powers are merged together at the mechan¬ 
ical coupling and then delivered to drive wheels. 

Figure 6c shows the configuration in which three 
power couplers are used, two mechanical and one elec¬ 
trical. There are plentiful available operation modes in 
the configuration that will be discussed in detail in 
following sections. 

Figure 6d shows a configuration which is similar to 
Fig. 6c. The only difference is that the electric coupling 
function is moved from the power converter to the 
batteries and one more power converter is added 
between the motor/generator and the batteries [2]. 



Steady power and dynamic power [2] 
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Electrically coupling (Series) 



Mechanical coupling (Parallel) 



Mechanical and electric coupling 
(Series-parallel) 


Mechanical and electric coupling (Complex) 
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Various configurations of hybrid vehicles [1] 


Electric Coupling HEV (Series) 

The electric coupling configuration (series) is schemat¬ 
ically shown in Fig. 7, on which, the possible power 
flow routes are also shown. This drive train may have 
been developed originally from pure electric vehicles by 
adding a “range extender” engine/generator set. The 
engine/generator unit is used to contribute part of the 
electric energy for compensation of battery energy 
shortage. An electric power converter is used for energy 
management purpose based on the states of charge 
(SOC) of the battery pack. As shown in Fig. 7, there 
are several operation modes available: 

1. Pure electric propelling mode, in which the engine/ 
generator is shut down and only batteries are used 
to propel the vehicle. 

2. Pure engine/generator traction, in which batteries 
are isolated from the power supply route and 
engine/generator power follows the load power. 
This operation mode is also used by diesel-electric 
locomotives 


3. Hybrid propelling mode, in which both batteries 
and engine/generator supply power to the traction 
motor. The power shares between batteries and the 
engine/generator are controlled by coordinate con¬ 
trols of the power electronic converter and engine/ 
generator. 

4. Engine/generator power split mode, in which the 
power of engine/generator is separated by the 
power converter, driving wheels and charging 
batteries. 

5. Regenerative braking mode, in which the electric 
motor is operated to function as electric braking to 
slow down the vehicle and get the braking energy 
back to the batteries. The engine/generator may be 
shut down meanwhile. 

6. Battery charging mode, in which the engine/gener- 
ator is operated only for charging batteries while 
vehicle stop. 

7. Hybrid battery charging mode, in which both the 
engine/generator and the regenerative braking 
charge the batteries. 
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Electric coupling (series) hybrid drive train configuration and power flow scenario 
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Directly connected power sources and motor drive 


Electric Coupling Devices 

The electric coupling device is the sole device at which 
two electrical powers are combined together. The oper¬ 
ation modes discussed above can be carried out by 
coordinately controlling the electric coupling device 
and engine/generator. The simplest structure may be 
to directly connect the three terminals together as 
shown in Fig. 8. 

The configuration in Fig. 8 has the features of 
identical voltage at the coupling point. The DC bus 
voltage is equal to the battery voltage. The power flows 
are only controlled by the voltage of the generator/ 
rectifier, which depends on the operation speed of the 
generator and magnetic field. For battery charging, the 
engine/generator voltage has to be higher than the open 


circuit voltage of the batteries. In regenerative braking, 
the bus voltage has to be higher than the battery volt¬ 
age. However, the voltage generated by the motor in 
generating mode is proportional to the motor speed. 
Thus, the regenerative braking capability at low speeds 
may be rather limited. 

A DC/DC converter can be added between the 
batteries and the DC bus to release the voltage con¬ 
straint as shown Fig. 9. This design may significantly 
improve the performance of the drive train. It has 
the following advantages: (1) variations in the battery 
voltage do not affect the bus voltage, thus the 
engine/generator can be operated at a constant speed; 
(2) the battery energy can be fully used, which is very 
important for a plug-in hybrid drive train; (3) low 
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Configuration with a DC/DC converter 


battery voltage and less battery cells can be used, which 
may lead to a small and light battery pack, low cost and 
high reliability; and (4) charging current of the batte¬ 
ries can be regulated during regenerative braking and 
charging from the engine/generator. 

It is obvious that the DC/DC converter has to be 
bidirectional. It is considered to be the appropriate 
design with battery voltage lower than DC bus voltage. 
It boosts the battery voltage to match the DC bus 
voltage for propelling. On the other hand, it bucks 
the DC bus voltage to the level of the battery voltage 
for battery charging. In the situation of regenerative 
braking with the generated voltage by the motor higher 
than the battery voltage, the DC/DC operates in buck 
mode. However, at low speeds, the DC bus may be 
lower than the battery voltage. In this case, the DC/ 
DC converter has to be operated in boost mode. The 
DC/DC operation mode is illustrated in Table 1 [2]. 

Figure 10 shows the topology of the four-quadrant 
DC/DC converter, operating in forward buck/boost 
and backward buck/boost by turning on or off the 
switches S l5 S 2 , S 3 , and S 4 . The operation of the DC/ 
DC converter is explained below. 

Forward Bucking Forward operation is defined as 
the energy flow direction from batteries to DC bus. 
Forward bucking operation is that the DC/DC con¬ 
verter supplies current to the DC bus which has 
a voltage lower than battery voltage. In this operation, 


Electric, Hybrid Electric and Fuel Cell Vehicles, Architec¬ 
tures of. Table 1 Basic function a DC/DC converter [2] 



Energy flow 


Battery 

discharging 

Battery 

charging 

Battery alone traction 
mode 

Boost 

- 

Battery charging from 
engine/generator 

- 

Buck 

Regenerative braking 

- 

Buck/boost 


switches S 2 , S 3 , and S 4 are turned off as shown in 
Fig. 11. Switch Si is operated on and off periodically. 
During the on period of Si, the batteries supply current 
to the DC bus through S 1? L d , and D 3 as shown in 
Fig. 11a, and during the off period, the batteries are 
isolated from the DC bus, and the current continuously 
flows through D 2 , L d , and D 3 . The bus voltage depends 
on the duty cycle <5, of S 1? which is defined as the 
ratio of the on period, t on , to whole period 
T(T = t G n T t G ff). When <5 = 0, DC bus voltage is 
zero, and when <5 = 1, DC bus voltage equals to the 
battery voltage: 

V hw = 8V h . (1) 

The pulse width modulation (PWM) method is 
used to control the on and off operation of S : to obtain 





































3376 


E 


Electric, Hybrid Electric and Fuel Cell Vehicles, Architectures of 


Rectifier 


T 



Electric, Hybrid Electric and Fuel Cell Vehicles, Architectures of. Figure 10 

Bidirectional DC/DC converter [2] 
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Forward bucking operation 


the desired DC bus voltage. This case may not be used 
with a design that the battery voltage is lower than the 
DC bus voltage. 

Forward Boosting When battery voltage is lower 
than the DC bus voltage, the DC/DC converter is oper¬ 
ated for boosting the battery voltage for supplying 
current to the DC bus. In this case, switch S x is on, S 2 
and S 3 are off, and S 4 is operated on and off periodically 
as shown in Fig. 12. During the on period of S 4 , 


batteries charge the inductor L d through Si and S 4 as 
shown in Fig. 12a. During the off period of S 4 , both 
batteries and the inductor L d supply current to the DC 
bus, which has a higher voltage than batteries. The DC 
bus voltage depends on the duty cycle of S 4 : 

^bus = ylF- (2) 

Backward Bucking Backward operation means the 
DC bus is supplying current to batteries from 
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Forward boosting operation 
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Backward bucking operation 


regenerative braking or engine/generator. Bucking 
operation is to reduce the DC bus voltage to battery 
voltage level. In this operation, S 1? S 2 , and S 4 are off and 
S 3 is operated on and off periodically. During the on 
period of S 3 , the DC bus supplies current to batteries 
through S 3 and D : as shown in Fig. 13a. During the off 
period of S 3 , the DC bus is isolated from the batteries. 
The batteries and DC bus have a voltage relation of 

V b = 8 F bus . (3) 

This mode may be used in the situation of charging 
the batteries from engine/generator and regenerative 
braking at high vehicle speeds. 

Backward Boosting This operation is to boost a low 
DC bus voltage to a high battery voltage. It may be used 
in regenerative braking at low vehicle speeds at which, 
the voltage generated by the motor is low. In this 
operation mode, S : and S 4 are off and S 3 is on and S 2 
is operated on and off periodically. During the on 
period of S 2 , the DC bus charges the inductor L d 
through S 3 and S 2 as shown in Fig. 14a. During the 


off period of S 2 , both the DC bus and inductor L d 
supply energy to the batteries. The DC bus and batte¬ 
ries have the voltage relation: 

Vb = ^. (4) 

Advantages and Disadvantages of Electric Coupling 
(Series) Drive Train 

Electric coupling (series) hybrid vehicles have the 
advantages of: 

1. Internal combustion engine is completely mechan¬ 
ically decoupled from wheels. Thus the engine oper¬ 
ation can be operated in its most efficient speeds 
range. On the other hand, since batteries supply 
supplementary power, engine power is also 
decoupled from load power. Therefore, the engine 
can be operated in its most efficiency power range 
as shown in Fig. 15. 

2. Since the engine can be potentially operated in 
a very narrow speed and power region, it becomes 
much easier for further improvement of fuel 
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Backward boosting operation 
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Illustration of a typical engine fuel consumption map 


economy and emission characteristics by using 
some engine special design and control technolo¬ 
gies in this narrow region than in the whole oper¬ 
ation domain. 

3. Completely mechanical decoupling of engine from 
driving wheels allows using engines that may not be 
suitable for direct vehicle propelling, such as gas 
turbine engine, sterling engine, etc. 


4. A multi-gear transmission and torque converter 
may not be required, since the vehicle is solely 
driven by an electric motor, which has proper 
torque-speed profile for vehicle traction. 

5. In-wheel motors may be used similar to a pure 
electric vehicle. 

6. The control of the drive train may be simple, com¬ 
pared with other configurations. 
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However, electric coupling (series) drive train is 
also suffered from some disadvantages such as: 


Inside the device, two torques, T x and T 2 , are added 
together in the following way: 


1. Two times energy form transformations from 
engine to wheels (mechanical to electrical and 
then to mechanical again) may cause significant 
energy losses. 

2. Additional electric machine (generator) adds 
weight and cost. 

3. Traction motor(s) have to be designed big enough 
to meet vehicle performance, such as maximum 
speed, acceleration, and gradeability. 

Mechanical Coupling Hybrid Drive Trains 

In mechanical coupling hybrid vehicles, mechanical 
coupling devices are used to couple two mechanical 
powers together. Mechanical power composes of force 
and speed in linear motion, and torque and angular 
speed in rotary motion. Accordingly, a mechanical cou¬ 
pling device may add to torques together or add two 
angular speeds together, depending on its mechanical 
structure. 


r 3 = h 7i + k 2 T 2 , (6) 

where /q and k 2 are structural parameters which are 
constants or variables, depending on the structure design 
of the device. T x and T 2 are independent of each other 
and then can be independently controlled. Solving 
Eqs. 5 and 6 gives the relationship among the speeds: 


Obviously, three speeds are linked together by /q 
and k 2 , and have a fixed relationship with each other. 
When a torque coupler is used in a hybrid vehicle, the 
engine torque can be decoupled from the wheel torque 
and motor torque is used for compensation. However, 
the engine speed varies with the vehicle speed. This 
feature provides the engine with an opportunity of 
always operating at its operation line at any vehicle 
load as shown in Fig. 17. Figure 18 shows the com¬ 
monly used torque couplers. 


E 


Torque Couplers A torque coupler is a three-port 
mechanical device which adds two input torques 
together and delivers them to output port as concep¬ 
tually illustrated in Fig. 16. Port 1 is a unidirectional 
input port (e.g., engine port), port 2 is bidirectional 
input port (e.g., electric motor port), and port 3 is 
a bidirectional output port (e.g., wheel port). 

Ignoring losses and inertia effect, the total input 
power is always equal to the output power. Supposing 
port 1 and port 2 are inputs and port 3 is output, one 
obtains: 

T 3 cj 3 = Ti coi + T 2 co 2 . (5) 


Speed Couplers Similar to the torque coupler (also 
refer to Fig. 16), a mechanical speed coupler can be 
described as: 

co 3 = ki coi + k 2 oj 2 (8) 

where /q and k 2 are the constants or variables associated 
with the device structure design. Among these three 
speeds, two of them are independent and then can be 
controlled independently. Combining with Eq. 5, the 
torques are linked together by: 

r 3 = F = Ik (9) 


Port 1 

Mechanical 

torque 

coupling 


r i,®i r 

' Port 3 ’ 


Port 2 

T> ^ CO^ 
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Concept of a mechanical coupling device 


A typical speed coupler is the planetary gear unit 
as shown in Fig. 19. It consists of a sun gear 1, a ring 
gear 2, and a yoke 3 which is hinged to the centers 
of several planetary gears (usually three or four gears 
for force balance). The relationship among co 1} co 2 , and 
co 3 is 


co 3 


1 

1 + gr 


CO l + 


gr 

1 + gr 


CO 2 


( 10 ) 


where g r is the gear ratio of ring gear to sun gear 
(R 2 /Ri = z 2 /sq, where z 2 and z x are the tooth number 
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Engine speed, rpm 
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Engine fuel consumption map and its optimal operation line 


of ring gear 2 and sun gear 1). Obviously, constants 
ki and k 2 in Eqs. 8 and 9 are k x = 1/(1 +g r ) and 
k 2 = £>/(!+£>)• The directions of the torques as shown 
in Fig. 19 (counterclockwise for sun and ring gears and 
clockwise for yoke) are defined as positive. The torque 
relationship can be expressed by Eq. 9. It can be seen 
from Fig. 19 that the torques on sun gear and ring gear 
always have the same direction and the torque on the 
yoke always has an opposite direction. 

Another interesting speed coupling device is an 
electric motor with a floating stator [2], in which the 
stator, generally fixed to a stationary frame in conven¬ 
tional motor, is released and forms a double rotor 
motor as shown in Fig. 20. In this device, port 2 is an 
electric port, at which electric power is converted into 
mechanical power at the air gap between the inner and 
outer rotor to generate a relative speed co 2 . Obviously, 

CO 3 = OJ\ T OJ 2 , (11) 

and 

T 3 = 7i = T 2 . (12) 

Here, k\ = 1 and k 2 = 1 in Eqs. 8 and 9. 

With a speed coupler, the engine speed is decoupled 
from the vehicle speed, thus the engine can always be 
operated within its optimal speed range. 

Selective uses of electric coupling, mechanical 
torque, and speed couplings can create many hybrid 
vehicle configurations as discussed in following 
sections. 


Hybrid Drive Train with One Torque Coupler 

Hybrid drive train with one torque coupler is the sim¬ 
plest configuration which is traditionally referred to as 
parallel configuration. The torque coupler can be 
designed with different ki and k 2 values as indicated 
in Eqs. 6 and 7 and Fig. 16. A two-shaft configuration 
with gear box as its coupler and two transmissions is 
shown in Fig. 21. 

In this configuration, the torque coupler can be 
designed with any parameter values of k x and k 2 . In 
front of the torque coupler, two transmissions can be 
used, which may be multi-gear, single gear, or contin¬ 
uous varying transmission (CVT), depending on the 
engine and motor torque-speed characteristics and 
vehicle performance requirements. Actually, two trans¬ 
missions and the torque coupler together can be taken 
as one assembly, which has the inputs of engine and 
motor and the output to final drive. The torque cou¬ 
pling assembly still has the same characteristic of the 
single torque coupler explained by Eqs. 6 and 7, except 
gear ratios of two transmissions being added, which 
can be expressed as: 

T 0 = k\ i e T e + k 2 i m T m . (13) 

and 


where co e , T e) co mi and T m are the angular speeds and 
torques of the engine and motor, i e and i m are the 






Electric, Hybrid Electric and Fuel Cell Vehicles, Architectures of 


E 


3381 


Gear Box 
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Commonly used torque coupling devices [2] 


transmission gear ratios of the engine and motor trans¬ 
missions, respectively. 

Obviously, two multi-gear transmissions produce 
many tractive effort profiles [2]. For example, two 
three-gear transmissions produce nine tractive effort 
profiles (3 x 3). High vehicle acceleration and 


gradeability performance can be obtained. Further, 
high overall efficiency of the drive train can also be 
obtained by selecting proper gears for operating the 
engine and motor within their best region. This design 
also provides opportunity for using the engine and 
motor which may have less favorable characteristics. 
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Planetary gear unit used as a speed coupler [2] 


Port 2 
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Electric motor with double rotors as a speed coupler [2] 

However, this design makes it complicated for the 
control system for selecting proper gears. 

Instead, a single gear transmission for the motor 
may be used, since electric motor generally has good 
torque-speed characteristics. The three-gear transmis¬ 
sion for engine is used to modify the engine torque- 
speed profile and provides opportunity for engine 
operating closer to its high-efficiency region. 

Another possible design is to use single gear trans¬ 
mission for engine and multi-gear transmission for 
engine. This design is considered to be an unfavorable 
design because it does not take advantage of the two 
power plants. 

The simplest design is to use two single gear trans¬ 
missions. When the engine and motor are designed to 
meet gradeability and acceleration performance, it is 
considered to be a good design. 


Another two-shaft torque coupling hybrid config¬ 
uration is to place a transmission behind the torque 
coupler as shown in Fig. 22 [ 1 ], in which, the engine 
and motor torque-speed profiles and the tractive effort 
are also illustrated. Obviously, the transmission mod¬ 
ifies the engine and motor torques with the same scale. 
The gear ratio of the torque coupler (/q and k 2 ) is 
designed to matches the speeds of the engine and 
motor. This design may be a proper selection for the 
one with larger engine and small motor, such as mild 
hybrid drive train. 

A simple shaft may be used as the torque coupler, 
which constitutes the configuration, traditionally 
called one-shaft parallel drive train. A transmission 
may be placed in front of the torque coupler as shown 
in Fig. 23. In this configuration, the rotor of the electric 
motor functions as the torque coupler, in which one 
end is connected to the engine and the other end to the 
transmission. Obviously, here, k x - 1 and k 2 - 1. This 
configuration has the similar operation characteristics. 
Since the motor directly drives the final drive, the 
motor speed is proportional to the vehicle speed, and 
the final drive gear ratio should be designed to match 
the motor speed and vehicle speed. 

An alternative configuration is to place the trans¬ 
mission behind the motor as shown in Fig. 24. In 
this configuration, torques of both engine and 
motor are enhanced by the transmission. However, 
the engine and motor are directly coupled together; 
hence, they have to have the same operation 
speed range. The motor can serve as the engine start 
when a clutch is placed between the motor and trans¬ 
mission. When this configuration is used in a mild 
hybrid drive train, where motor alone operation 
mode is used, the clutch can be placed between the 
motor and transmission. The motor functions as an 
engine torque booster, engine start, generator, and 
regenerative braking. 

Actually, the vehicle chassis and road can also used 
as a torque coupler. Figure 25 shows a hybrid drive 
train in which the engine and motor drive the front 
and rear wheels, respectively. The tractive efforts pro¬ 
duced by the engine and motor are added together by 
the vehicle chassis and road. This configuration offers 
some advantages of keeping the original engine and 
transmission driveline and providing the vehicle with 
a four-wheel-drive capability. 
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General torque coupling hybrid drive train [2] 


T e 
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Two-shaft configuration with a transmission behind the torque coupler [2] 


Hybrid Drive Trains with Torque and Speed 
Couplings 

As discussed in prior sections, the torque coupler can 
decouple the engine torque from the wheel’s torque 
demand, and speed coupler can decouple the engine 
speed from the wheel speed. Thus, it is possible to 


decouple the engine torque and speed from the wheels 
with two couplers. This concept is illustrated 
in Fig. 26. 

A detailed drive train structure is shown in Fig. 27 
[1]. The drive train uses a planetary gear unit as its 
speed coupler. Engine is connected to its yoke and 
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One-shaft configuration with a transmission in front of the torque coupler [1] 
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One-shaft configuration with a transmission behind the torque coupler [2] 


a motor/generator to its sun gear. The ring gear as 
the output port is connected to a torque coupler 
which is a fixed shaft gear set. Another two ports of 
the torque coupler are connected to a traction motor 
and wheels. 

As indicated in Eqs. 8 and 9, Fig. 19, the torque of 
the motor/generator always has an opposite direction as 
that of the engine and can rotate in two directions. When 
the rotating speed is in the same direction as its torque, it 
is a motor, otherwise, a generator. When it operates in 
the generating mode, it absorbs part of the engine power. 


Due to this reason, this hybrid drive train gets another 
name of engine power split hybrid drive train. 

The torque of the motor/generator is always pro¬ 
portional to the engine torque. Thus, an ideal opera¬ 
tion characteristic is a constant torque at the whole 
positive and negative speed domain. Also, the three 
controllable components (the engine, motor/generator, 
and traction motor) provide very plentiful operation 
modes, and hence, greater opportunity for high oper¬ 
ating efficiency. However, very complex control is 
required [2]. 
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Front- and rear-wheel-driven hybrid vehicle 
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tures of. Figure 26 

Concept of torque and speed coupling drive train 

Architectures of Fuel Cell Vehicles 

Fuel Cell System 

A fuel cell is a chemical device that converts the chemical 
energy into electric energy by means of redox reaction 
with the operation principle exactly the same as in 
a chemical battery. In a battery cell, the reactive chemicals 
are pro-installed in the cell. When the active chemicals are 
used up, the cell is dead and has to be recovered by 
charging it from an outside electrical source. Instead, 
a fuel cell is fed with its reactive chemicals continuously 
from outside of the cell. It is more like an internal 
combustion engine which is fed with fuel and produces 
mechanical energy. However, the energy conversion effi¬ 
ciency in a fuel cell is higher than internal combustion 
engine due to its free from the Carnot cycle limitation. 


Compared to chemical batteries, it has the advantages 
of quick refueling and much more energy in an outside 
energy storage, similar to a fuel tank. 

A fuel cell has a very similar structure of chemical 
battery cell, as shown in Fig. 28. Fuel and the oxidizing 
agency are continuously and separately fed into the 
anode and cathode electrodes, where they are ionized. 
Electrolyte is used to conduct positive ions from the 
anode to the cathode, and at the same time, electrons 
are conducted from the anode to the cathode through 
an electrical load. 

A single fuel cell has very low voltage. In practice, 
a fuel cell stack may include hundreds of single cells 
serially connected together. A fuel stack needs auxil¬ 
iaries to support its operation. The auxiliaries mainly 
include air circulating pump, coolant circulating 
pump, ventilation fan, fuel supply pump, and electric 
control devices. Part of the power developed from the 
fuel cell stack is used to support the operation of the 
auxiliaries as shown in Figs. 29 and 30. 

Fuel Cell Technologies 

There are several types of fuel cells, depending on their 
electrolytes. Table 2 lists the operation temperature and 
electrolytes. 

Proton Exchange Membrane Fuel Cells (PEMFCs) 

PEMFCs work with hydrogen as fuel and oxygen as 
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Hybrid drive train with speed and torque couplers 



Electric, Hybrid Electric and Fuel Cell Vehicles, Architectures of. Figure 28 

Illustration of fuel cell operation 


oxidant. Hydrogen is supplied to the anode where it is 
oxidized, losing electrodes in the process. Positive 
hydrogen ions (protons) migrate across the electrolyte 
through the membrane to the cathode, at the same 


time, the electrons travel round the external circuit to 
the cathode. The oxygen is supplied to the cathode 
where it is reduced, picking up the electrons and the 
ions from the hydrogen to form water. 
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A hydrogen-air fuel cell system 
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tures of. Figure 30 

Cell voltage, system efficiency, and net power density 
varying along net current density of a hydrogen-oxygen 
fuel cell [2] 

The electrodes of PEMFCs are made of porous 
carbon which allows the active gases to pass through. 
The electrode surfaces are coated with platinum as 
a catalyst which increases the rate of oxidation at 
the anode and the rate of reduction at the cathode. 
The platinum catalysts allow the chemical reaction to 
take place at a lower temperature. Alternatively to avoid 


Electric, Hybrid Electric and Fuel Cell Vehicles, Architec¬ 
tures of. Table 2 Operating data of various fuel cell 
systems [3] 


Cell system 

Operating 
temperature, °C 

Electrolyte 

Proton exchange fuel 
cells (PEMFCs) 

60-100 

Solid 

Alkaline fuel cells (AFCs) 

80-230 

Liquid 

Phosphoric acid fuel 
cells (PAFCs) 

60-200 

Liquid 

Molten carbonate fuel 
cells (MCFCs) 

600-1000 

Liquid 

Solid oxide fuel cells 
(SOFCs) 

1000-1200 

Solid 

Direct methanol fuel 
cells (DMFCs) 

100 

Solid 


the cost of expensive catalysts, some fuel cells are 
designed to work at elevated temperatures. 

The platinum catalyst used in PEMFCs is very 
expensive and extremely sensitive to poisoning by 
even small amounts of carbon monoxide. Thus a high 
purity of hydrogen gas is necessary. 

Electrolyte is a thin, fragile sheet of acidic, solid 
organic polymer about 50 pm thick which permits the 
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passage of hydrogen ions but is impermeable to 
electrons. Acidic compounds are fluids with free 
hydrogen ions and these are the fuel cell’s charge 
carriers [4]. 

The first PEMFCs were developed in the 1960s for 
the need of a US-manned space program. It is nowa¬ 
days the most investigated fuel cell technology for 
automotive applications. It operates at 60-100°C and 
can offer a power density of 0.35-0.6 W/cm 2 . PEMFCs 
offer some definite advantages for electric and hybrid 
vehicle applications that include low temperature oper¬ 
ation, high power density among all the available types 
of fuel cells, solid electrolyte that does not change, 
movement and vaporization from the cell, and most 
importantly, potentially high efficiency and being free 
of toxins and greenhouse gas emissions. The major 
problems are their requirements of expensive noble 
metal catalyst and membrane, and a need of high purity 
of hydrogen as its fuel. 

Alkaline Fuel Cells (AFCs) Alkaline fuel cells use 
aqueous electrolytes of potassium hydroxide (KOH) 
to conduct ions between electrodes. Hydrogen and 
oxygen are used as the fuel and oxidant. The ion carried 
by the alkaline electrolyte is a hydroxide ion (OH - ). 
The half reactions are: 

Anode : 2H 2 + 40H - —> 4H 2 0 + Ae~ 

Cathode : 0 2 + 2H 2 0 + 4e - —> 4HO - 

AFCs are capable of operating over a wide range of 
temperature (80-230°C) and pressure (2.2-45 atm). 
AFCs are capable of achieving high operating efficiency 
because of the fast kinetics permitted by the hydroxide 
electrolyte. The fast kinetics in AFCs allows using silver 
or nickel as catalysts instead of expensive platinum. 

The performance of AFCs can be further improved 
by circulating the electrolyte, which offers some advan¬ 
tages that include easy thermal management, more 
homogenous electrolyte concentration, easy replace¬ 
ment of electrolyte, and possible removal of the elec¬ 
trolyte when it is turned off. Of course, circulation of 
electrolyte needs additional equipments, such as circu¬ 
lation pump, heat exchanger, and evaporator. Further, 
it increases the risk of leakage (potassium hydroxide is 
highly corrosive and has a natural tendency to leak even 
through the tightest seals). 


AFCs were some of the earliest practical cells and 
were used in the Apollo space program, generating 
drinking water as well as electric power. Although 
they are inexpensive compared with PEM cells, operat¬ 
ing efficiencies of 60% are possible. Unfortunately they 
have a low power output and the catalyst is prone to 
poisoning from carbon dioxide in the atmosphere. 

Phosphoric Acid Fuel Cells (PAFCs) PAFCs use 
acidic electrolyte to conduct hydrogen ions and the 
anode and cathode reactions are the same as in 
PEMFCs. PAFCs run at high temperature of around 
200° C, delivering high power of a MegaWatt or more 
but with relatively low efficiency of around 35%. The 
poor conversion efficiency is the high heat generation 
in the fuel cell stack. Because of the high working 
temperature the efficiency losses can be mitigated by 
using the waste heat in combined heat and power 
(CHP) applications. 

The advantage of PAFCs is the cheap electrolyte and 
disadvantages are the need of expensive catalyst (plat¬ 
inum), corrosion by acidic electrolyte, CO poising, and 
low efficiency. 

Molten Carbonate Fuel Cells (MCFCs) MCFCs run 
at high temperatures of 650-1,000°C. They use molten 
carbonate salt to conduct ions, usually, lithium- 
potassium carbonate or lithium-sodium carbonate. 
Their unique chemistry needs carbon dioxide from 
the air as part of the process. Efficiencies achieved are 
45% or more and power outputs of over 1 MW are 
typical in grid supply applications. Because of their 
high working temperature, they can operate directly 
with hydrocarbon gases which are reformed within 
the cell and do not need a separate hydrogen supply. 
The high temperature also means that less expensive 
catalysts are needed, but the molten electrolyte 
imposes special requirements on containment and 
anticorrosion measures. 

Solid Oxide Fuel Cells (SOFCs) Solid oxide fuel cells 
also operate in the same or higher temperatures as the 
molten carbonate cells with the same fuel and catalyst 
advantages. The ceramic electrolyte which can run as 
hot as 800° C has the advantage that the electrolyte stays 
solid. They can deliver powers of several Megawatts but 
at a lower efficiency of around 35%. 
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Direct Methanol Fuel Cells (DMFCs) Direct meth¬ 
anol fuel cells are similar to PEM cells. Instead of pure 
hydrogen, they directly use liquid Methanol as fuel. It is 
sure that there are definite motivations for using 
DMFCs to vehicles. Methanol is a liquid fuel at room 
temperature that can be easily stored, distributed, and 
marketed through current infrastructures. Further, 
methanol is the simplest organic fuel that can be eco¬ 
nomically and efficiently produced on a large scale 
from fossil fuel, such as coal and natural gas. Further 
it can be also produced from biomass products. 

Anode and cathode electrode of DMFC also need 
platinum or platinum alloys as catalyst. The electrolyte 
can be a trifluoromethane sulfonia acid or proton 
exchange membrane (PEM). The chemical reactions 
in a DMFC are: 

Anode : CH 3 OH T H 2 O —> CO2 T 6 H^ T 6 c 
3 

Cathode : -0 2 + 6 H + + 6e —> 3H 2 0 

Overall: CH 3 0H + ^0 2 -► C0 2 + 2H 2 0 

Tike PEM fuel cells, DMFCs work at low tempera¬ 
tures in the range of about 50-120° C. They have 
a relatively low efficiency and power density. Output 
power using current technology is limited to about 
1.5 kW which is enough to power most consumer 
goods but not sufficient for automotive applications. 


Nevertheless the ability to use liquid fuel coupled with 
the elimination of the reformer make these fuel cells 
very attractive. 

Fuel Cell Vehicles 

Fuel cell-powered vehicles potentially have the advan¬ 
tages of no toxic and greenhouse emissions and high 
operation efficiency. Compared to internal combustion 
engine, fuel cells have a prominent advantage of high 
operation efficiency at low power output which is coin¬ 
cident to a vehicle’s normal driving situation. This 
means that fuel cells in a fuel cell-powered vehicle 
will naturally operate in their high-efficiency region 
in normal driving conditions. 

Fuel cell-powered vehicles can be either fuel cell 
alone or hybridized with batteries and/or ultra¬ 
capacitors. Fuel-cell-alone-powered vehicles have 
a very simple configuration, similar to pure electric 
vehicles by replacing the battery pack with a fuel cell 
system as shown in Fig. 31. 

A fuel-cell-alone-powered vehicle is a single power 
source vehicle. The fuel cell power follows the load 
profile. The power rating of the fuel cell system is 
designed to meet the vehicle performance requirement. 
A fuel cell system management is required to manage 
the fuel cell operation, such as air supply, fuel supply, 
and cooling of the fuel cells for operating in their 
safe range. 
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Configuration of fuel-cell-alone-powered vehicle drive train 
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Fuel cell is a unidirectional energy converter. It 
cannot operate in opposite directions to absorb energy 
from an electric energy source. Thus, a fuel-cell-alone- 
powered vehicle is not capable of recovering braking 
energy. 

Fuel Cell-Powered Hybrid Vehicles 

Fuel cell system, at today’s technology, has low power 
density. For satisfying vehicle performance, a bulky and 
heavy fuel cell system is needed, which in turn, harms 
the vehicle performance and causes high energy losses. 
In addition, lack of regenerative braking capability is 
a major shortcoming of this drive train. 

Hybridization of fuel cell system with a secondary 
power source, such as a battery pack and/or 
ultracapacitor pack is an effective technology to over¬ 
come these shortcomings. A fuel cell hybrid vehicle 
drive train is shown in Fig. 32. 

Fuel cell hybrid vehicle drive train consists mainly 
of a fuel cell system as primary power source, a 
secondary power source that may be a battery pack 


and/or ultracapacitor pack. The powers of the fuel 
cells and secondary power source are coupled together 
at an electronic interface which is used to control the 
power flows. A vehicle controller is used to control the 
drive train based on the driver’s command and system’s 
feedback signals. 

There are several operation modes that can be 
selectively used which are: 

1. Fuel cell alone propelling: in which the electronic 
interface isolates the secondary power source from 
the power line and fuel cell pack alone supplies 
power to the electric motor. 

2. Secondary power source alone propelling: in which 
the fuel cell pack is isolated from the power line 
and the secondary power source alone supplies 
power to the traction motor. 

3. Hybrid propelling: in which both the fuel cell sys¬ 
tem and the secondary power source provide pow¬ 
ers to traction motor simultaneously. 

4. Fuel cell power split mode: in which fuel cell power 
is split into two parts by the electronic interface. 


Accelerator Brake 



Electric, Hybrid Electric and Fuel Cell Vehicles, Architectures of. Figure 32 

Configuration of a fuel cell-powered hybrid drive train 









































Electric, Hybrid, and Fuel Cell Vehicles, Introduction 



One part is used to power the traction motor and 
the other charges the secondary power source. 

5. Regenerative braking: in which the traction motor 
functions as a generator to recover the braking 
energy and stores it in the secondary power source. 

By electively using the operation modes, the fuel 
cells can be operated in their most efficient region as 
shown in Fig. 33. 

The electronic interface plays a critical role in the 
drive train control. It should be designed with three 
ports and the ports to the secondary power source and 
electric motor are bidirectional. The configuration and 
operation principle are very similar as that used in 
engine/generator-based electric coupling (series) 
hybrid drive train. 

Future Directions 

The electric vehicle has a relatively simple power train 
structure. The most important components are the 
motor drive and the battery system. An advanced 
motor drive with high power density, high efficiency, 
and long extended speed range is the key technology. 
The success of an electric vehicle is almost determined 
by the performance of the battery system. The basic 
requirements for electric vehicle batteries are the high 
energy capacity for extended range, high power for 
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Optimal operation region of a fuel cell system 


vehicle performance, high safety, and low cost. At pre¬ 
sent, lithium-ion batteries are considered to be the 
most promising candidates. For hybrid vehicles, 
advanced primary power sources are very important 
that may include advantage engine technologies and 
alternative fuel engines. Advanced control strategies 
and real-time control algorithms are also very impor¬ 
tant. The key technologies for fuel cell vehicles include 
enhancement of fuel cell power density, reduction of 
cost, and, more importantly, hydrogen production and 
onboard storage. 
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The success of internal combustion engine vehicles 
over the past century is one of the greatest technolog¬ 
ical achievements of mankind with a profound impact 


3391 


E 












3392 


E 


Electric, Hybrid, and Fuel Cell Vehicles, Introduction 


on virtually every aspect of human life on the plant. 
However, the very widespread success of this technol¬ 
ogy has produced associated problems which can be the 
seeds of its own long-term demise. Solution of these 
problems will guarantee the long-term survival of per¬ 
sonal and mass transit vehicles which are so essential to 
the economy and welfare of human beings. The proper 
combination of these solutions can bring about what is 
termed sustainable vehicles. 

This section of the encyclopedia describes the 
important sustainable vehicle technical topics and the 
present state of the art on each topic. The following is 
a brief review of the entries in this section. 

► Sustainable Transportation describes the techni¬ 
cal and environmental problems that have to be 
addressed to achieve sustainable vehicles for the future. 
Issues of fuel, environment, and vehicle technologies 
are introduced. 

► Electric, Hybrid Electric and Fuel cell Vehicles, 
Architectures of describes the basic configuration of the 
drivetrain of electric vehicles and hybrid electric vehi¬ 
cles. The various types of hybrid electric vehicles’ drive- 
trains are presented. The basic operation of each of 
these systems and their components are described. 

► Vehicle Dynamics and Performance describes 
vehicle drivability, fuel economy, braking performance, 
handling characteristics, noise, vibration, and harsh¬ 
ness. Vehicle performance discussed in this entry is 
restricted to propelling and braking performance in 
terms of vehicle speed, acceleration, road grade 
climbing, braking deceleration, and braking force dis¬ 
tribution on the front and rear wheels. 

► Vehicle Energy Storage: Batteries gives an over¬ 
view of batteries for vehicle applications and discusses 
the research and development roadmap of next- 
generation batteries for vehicle applications. It 
describes how the battery is used in EV and HEV and 
what the vehicle requirements on it are. It also presents 
the basic state of the art on the available batteries today. 

► Battery Technologies describes the internal work¬ 
ing of the electrochemical battery, the relationship 
between the internal resistance of the battery cell, max¬ 
imum power output, energy efficiency in charging and 
discharging, and the issue of cell heating. A review of 
the characteristics of lead acid, nickel metal hydride, 
lithium-ion batteries, and super capacitors are then 
presented. 


► Energy storage: Ultracapacitor describes the new 
technology of ultracapacitors for high-power energy 
storage and delivery. This is another approach to 
providing energy storage for hybrid vehicles. 
Ultracapacitors can be used either alone or in combi¬ 
nation with batteries. In this approach, the batteries 
can be optimized for energy density and cycle life, and 
ultracapacitors can provide the power both for accel¬ 
eration and regenerative braking. 

► Hybrid Energy Storage Systems for Vehicle 
Applications describes the possibility of composing 
a high-energy, high-power hybrid energy storage sys¬ 
tem for vehicle applications, based on the analysis of 
inherent characteristics of different energy storage 
methods. The basic components in this system are 
chemical batteries, ultracapacitors, and flywheels 
(mechanical batteries). The objective is to fully utilize 
the advantages and suppress the disadvantages of each 
component to compose a compact energy storage sys¬ 
tem with high energy capacity and high power 
capability. 

► Batteries, Battery Management, and Battery 
Charging Technology describes issues related battery 
charging and battery management. Battery manage¬ 
ment is one of the important engineering challenges 
of using batteries in vehicles. It starts with an overview 
of different types of rechargeable battery chemistries. 
Then, it discusses circuits for charging and managing 
vehicle batteries. 

► Vehicle Traction Motors describes the role of the 
electric motor in EV and HEV drivetrains. The require¬ 
ments that the vehicle put on the electric motor are 
explained. This is used as the basis to present the 
various traction motor drive technologies. 

► AC Machines: Permanent Magnet Synchronous 
and Induction Machines describes the most common 
electric machines, motors, and generators that are used 
in EV and HEV drivetrains. It presents AC machines 
including induction machine (IM) and permanent 
magnet synchronous machine (PMSM). It is shown 
that both the IM and PMSM take advantage of the 
rotating magnetic field that is produced by their 
three-phase stator windings. 

► Switched Reluctance Motor Drives for Propul¬ 
sion and Regenerative Braking in EV and HEV 
describes a new basis for the selection of vehicle trac¬ 
tion motor technology. The concept of extended speed 
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range at constant power and its benefit in vehicle trac¬ 
tion application is introduced. On this basis, the mod¬ 
ern switched reluctance motor drive technology is 
shown to be very attractive for future EV and HEV 
applications. In addition to having a wide speed 
range, these motors have a rugged and low-cost struc¬ 
ture, reliable drive power converter topology, and high 
efficiency over a wide range of speed. 

► Regenerative Braking describes the important 
technology of regenerative braking in EV and HEV 
drivetrains. Regenerative braking is responsible for 
most of the battery energy and fuel savings in these 
vehicles. This entry presents the principles of regener¬ 
ative braking operation, design, and its hardware. 

► Hybrid Electric and Hydraulic Technology Appli¬ 
cations in Off-Road Vehicles describes a special call of 
HEV that are more suitable for off-road and heavy- 
duty applications. These include military vehicles, 
construction and mining vehicles, and similar indus¬ 
trial and special-purpose heavy uses. The special use 
of hydraulics in the drive train as part of the hybrid¬ 
ization of the vehicle is of particular interest in this 
entry. 

► Plug-in Hybrid Electric Vehicles describes 
a special class HEV that is called plug-in HEV 
(PHEV). It is shown that the PHEV has certain advan¬ 
tages over pure EV and HEV, such as (1) displacement 
of significant amount petroleum fuel by electric energy 
that comes from utility grid, (2) longer range than 
a pure EV, (3) no fuel consumption and emission 
during pure EV mode, and (4) lower overall fuel con¬ 
sumption and emission. 

► PHEVs and BEVs in Coupled Power and Trans¬ 
portation Systems describes the role of utility electric 
power systems in the EV and HEV transportation sys¬ 
tem. The impact of plug-in HEV and battery-operated 
EV (PHEV/BEV) transportation systems on the utility 
power systems is discussed. The mutual benefits of 
charging the EV/HEV from the utility grid and some 
of their draw backs are addressed. 

► Internal Combustion Engines, Alternative Fuels 
for describes the technical implications of alternative 
fuels on vehicle engines, their performance and emis¬ 
sions characteristics. The characteristics of the most 
dominant alternative fuels are defined and compared 
to conventional gasoline and diesel fuel. Of particular 
interests are the renewable alternative fuels. 


► Vehicle Biofuels describes the nature and origins 
of liquid fuels for vehicle engines that are derived from 
biomass. Liquids biofuels are the most valuable because 
they are easily stored, have a high energy density, and 
are readily metered into engines. It is shown that 
biofuels are infinitely renewable provided appropriate 
agricultural practices are followed. 

► Sustainable Vehicle Fuels, Well-to-Wheel Analy¬ 
sis describes the important stages, results, and key 
issues associated with the well-to-wheel stages of 
biofuels, hydrogen, and electricity as alternative trans¬ 
portation fuels. The importance of comparing alterna¬ 
tive vehicle engine fuels with the conventional fossil 
fuels is presented from the fundamental point of view. 

► Fuel Cell Powered HEV Design and Control 
describes different types of fuel cells and their electro¬ 
lytes, sources of fuels for fuel cells, those applicable to 
the transportation sector, and the hybrid vehicle archi¬ 
tecture. The ICE hybrid is also presented as a base 
model, since its operation is well understood by the 
general public. 


Electrical Transmission Systems and 
Smart Grids, Introduction 
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Transmission systems represent the backbone of the 
electric energy. They support transport of electric 
energy from large producers (power plants) to the 
load centers (residential areas, manufacturing facilities, 
business centers or a combination thereof). Those 
networks are probably among the largest human- 
made engineering systems - the transmission network 
in the United States covers over 300,000 km of lines and 
is served by 500 companies (electric utilities). 

In contemporary power systems, the notion of net 
energy producers and net users is increasingly blurred 
as most economic generator capacities are of smaller 
size and can be (and often are) installed near users’ 
locations - example of small photovoltaic generators or 
wind farms which can be installed on the roofs of 
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residential homes or in their vicinity illustrates that 
alternative, popular for many reasons (little or no 
maintenance needed, decreasing cost of electricity 
generated by such small generators, many of which 
are based on various renewable energy sources, 
reduction of congestion which is the result of carrying 
large amounts of power across vast distances, reduction 
of transmission losses, regulatory and policy-driven 
economic incentives for small owners of generators, 
reduction of carbon footprint, and enhancement of 
sustainability of such solutions, etc.) The need for 
electric energy systems is not only to run the 
equipment in manufacturing facilities or appliances 
in residential homes, but also to interact in various 
ways with other supporting infrastructures (water, 
gas, transportation, information, etc.) Those 
infrastructures are interdependent on one another 
and their efficient and reliable operation requires a 
thorough understanding of those interdependencies 
and adequate planning and operational support 
(see ► Energy and Water Interdependence, and Their 
Implications for Urban Areas). 

As electric energy travels across waveguides 
(conductors), it incurs losses due to dissipation of 
current across the resistances of imperfect conductors. 
High-voltage overhead conductors do not need insula¬ 
tion. The conductor material is nearly always made of 
an aluminum alloy, which typically forms several 
strands and often is reinforced with steel strands 
for mechanical strength. Copper was more popular 
conductor choice in the past, but substantially lower 
weight and cost of aluminum and its only marginally 
inferior electrical performance have been the reasons 
for the current dominance of aluminum-based 
conductors in the transmission networks. As large 
amounts of power being transferred across the lines 
may incur considerable losses in transmission, 
typically such bulk transfers are performed at higher 
voltages, which require smaller currents (losses in the 
conductors are proportional to the square of the 
current, which means that operating the transmission 
line at double the voltage incurs only about 25% of 
the losses produced while operating at lower voltage). 
It is the need for changing the operating voltage levels 
as a function of the energy throughput that has forced 
design transition from the original DC transmission 
(introduced by Thomas Edison) into AC, originally 


deployed by Nikola Tesla and George Westinghouse in 
the 1880s to transfer power from the power plant at 
Niagara Falls, and nowadays used almost everywhere. 
Today, the highest operating AC transmission voltages 
can be up to 500 kV and even 800 kV. Ironically, 
when need for high power transfers calls for operation 
at voltages higher than 800 kV, it is done via DC 
transmission and with use of the large HVDC converter 
stations. The reason is inductances of the large 
overhead transmission lines, which at the highest 
voltages ultimately choke the efficient transmission of 
electric energy. 

Transmission voltages are usually considered to be 
110 kV and above. Lower voltages such as 66 kV and 
33 kV are commonly called sub-transmission voltages. 
Voltages less than 33 kV are mostly used for distribu¬ 
tion. Design of distribution networks is driven by their 
traditional role as infrastructure for disseminating 
bulk electric energy to a large number of customers. 
Such fragmentation of delivery requires distribution 
networks to operate somewhat analogous to capillaries 
in a cardiovascular system, at smaller capacities 
(and lower voltages) and covering large areas of 
sparsely populated customers (in rural areas) or 
densely populated smaller areas (in modern urban 
settings where increasingly large part of the world 
population now resides). Traditional design is evolving 
of radial distribution network, consisting of feeders 
supplied from the substations which interconnect 
them to the bulk power transmission networks. Such 
simple configurations were enabled by unidirectional 
flows of energy and simple distribution hardware 
which was supporting it. Transformation of 
distribution system into the site of both consumption 
and (distributed) generation of electric energy, as well 
as increasing need for enhanced interconnectivity at the 
distribution level, is imposing need for different 
designs. Part of the contemporary distribution 
networks are likely to experience a transition to 
microgrids (more meshed and better controlled 
distribution networks which can be used flexibly as 
reconfigurable autonomous, or grid-connected 
infrastructure for distribution of electric energy). 

The structure and function of electric substations is 
also changing rapidly. Substations represent a vital part 
of the power grid infrastructure with many important 
functions (ability to reconfigure the system topology, 
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isolate equipment for maintenance and repair, monitor 
and communicate various system parameters and 
electrical variables to the control center or elsewhere, 
actuate control actions or protective relaying decisions, 
etc.) Substation automation represents one of the 
fastest evolving parts of the modern (smart) grids and 
will continue its evolution to keep up with the demands 
for more flexible and effective monitoring, control, 
and protection of power systems, both on the 
transmission and distribution side (see ► Distribution 
Systems, Substations, and Integration of Distributed 
Generation). 

A growing part of the distribution networks, 
especially in developed countries, is being served by 
underground cables instead of overhead lines. There 
are many reasons in favor of such solutions (reliability, 
esthetics of the area where underground lines are 
installed, less vulnerability to the elements, etc.) and 
some against (considerably higher cost, faster pace of 
aging, especially due to moisture and impulse electrical 
stress, such as a consequence of lightning strikes in 
vicinity of the installations, and lack of effective 
diagnostic procedures to assess the operational status 
and effective remaining lifetime of the cables, more 
expensive repairs, etc.) Nevertheless, underground 
distribution (and in some places, underground 
transmission) represent a growing portion of the 
energy systems’ assets, both in terms of importance 
and cost, and considerable care needs to be paid to 
their management and upkeep (see ► Underground 
Cable Systems). 

Distributed generation (DG) can be defined 
as small-scale, dispersed, decentralized, and on-site 
electric energy systems. Currently, capacities of DGs 
vary typically in the range of several kW to hundreds 
of MW. As more DG penetrates the electric energy 
systems, more accurate and efficient system analysis 
algorithms are needed in order to analyze the impact 
of the DG system on various types of microgrids 
and distribution networks. Since DG can change the 
operation of the distribution system and interfere with 
its protection and control, electric power utilities are 
not motivated to interconnect customer-owned small 
generators to their distribution networks. Utilities 
tend to put nonutility generation under the extensive 
technical analysis. Conversely, the regulating authori¬ 
ties tend to act in favor of DG owners and support 


that the interconnection be as easy and transparent as 
possible. Nevertheless, the favorable economic features 
of small-scale distributed generators, especially 
those using the renewable energy resources as input, 
will make them increasingly popular and much more 
widespread than they are at the moment of creation of 
this text. Even now, many countries (Ireland, Spain, 
Denmark, etc.) possess considerable renewable 
resources as part of their generation portfolios. At the 
time when wind generation of electricity is the fastest 
growing new generation technology (in terms of new 
installed capacity) and when the energy produced 
by renewable resources (including hydro plants) is 
already larger than energy obtained from the nuclear 
power plants (United States in 2011), engineering 
challenges of planning, operating, controlling, and 
protecting the new power grids are substantial 
and require major transformative changes (see 
► Renewable Generation, Integration of). This may 
become even more important as some countries elect 
to gradually abandon conventional nuclear generation 
and transition to other, more sustainable, generation 
resources. 

One of the fundamental constraints in the 
transmission and distribution of electricity is that, for 
the most part, electrical energy cannot be stored, and 
therefore must be generated whenever needed. Few 
exceptions to that limitation have been found 
and exploited. The biggest problem is that currently 
available storage options cannot effectively be used at 
utility-scale capacities (pumped hydro plants are the 
best known among them). In the interim, a large num¬ 
ber of smaller capacity storage technologies have been 
developed and advanced (superconductive magnetic 
storage, flywheel, battery storage, etc.), mostly to find 
applications as uninterruptible power supplies for crit¬ 
ically important (but relatively small) loads and rarely 
exceed the capacities needed to achieve similar effects in 
the bulk power networks. When a storage technology is 
developed and commercialized to operate at such large 
capacity levels economically, it will trigger a major rev¬ 
olution in power grid planning and operation. 

A very advanced infrastructure of monitoring, 
control, and protection (see ► Wide Area Monitoring, 
Protection and Control) is required to enable real-time 
balancing between electric generation and energy 
demand due to the lack of effective storage options. 
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If generation and consumption of electric energy are 
not in balance or if the complex infrastructure of volt¬ 
age control across the system is challenged by the heavy 
loading conditions and/or the aftermath of an 
unforeseen major disruption in system operation 
(such as the outage of a major piece of equipment), 
generation plants and transmission lines can shut 
down which may lead to a major blackout. Sometimes 
such blackouts develop spontaneously in a cascading 
chain of equipment outages caused by spreading of the 
overloads through the equipment system-wide as 
a consequence of an initial disturbance, which may be 
relatively minor in its initial effects. These series of 
events may occur in unanticipated sequence, and 
can be hard to foresee even with large computers. An 
unexpected contingency may cause an amplifying effect 
of larger sequential outages and progressively stress the 
system to the point when the disturbance can no longer 
be contained. In the domain of large power grids, 
“flapping of the butterfly wings” in certain places can 
literally produce a “storm” in others (see ► Transmis¬ 
sion Blackouts: Risk, Causes, and Mitigation). 

To reduce the risk of such failures, electric 
transmission networks are highly meshed and 
interconnected into regional, national, or continental 
wide networks, providing multiple redundant alternate 
paths for energy to flow when needed. Considerable 
effort is expended by electric utility companies to 
ensure that sufficient spare capacity and redundant 
pathways for energy transfer are always available 
to mitigate the consequences of even large multiple 
disruptions of the network. In order to maintain 
sufficient security margins under the threat of multiple 
unpredictable contingencies, well-coordinated plans 
for preemptive (slower, based on extensive 
optimization algorithms applied to coordinate genera¬ 
tion and control system wide) and preventive (much 
faster, emergency control and protection) actions need 
to be developed so that the system does not descend 
into a blackout and necessitate lengthy and costly 
restoration procedures (see ► Smart Grids, Distributed 
Control for). Particular focus should be on 
determining the ability of system to survive extreme 
contingencies, triggered by very unlikely chains of 
events, but capable of propagating into costly wide¬ 
spread outages with long-term consequences for both 
consumers and the power companies. 


All of the above functional characteristics and 
solutions describe what is commonly referred to as 
smart grid. Gradual application of emerging technolo¬ 
gies for advanced power grid management and control/ 
protection represents an effective transition to smart 
grid, which from the perspective of different authors 
and researchers may assume different characteristics, 
but in general shares the following functional 
properties: 

• Ability to resiliently recover to the extent possible 
from the effects of damaging or disruptive distur¬ 
bances (self-healing) 

• Providing opportunities for consumer participation 
in energy management and demand response (often 
via advanced metering options which may provide 
additional support and information, both for the 
utility and the customer, during normal operation) 

• Ability to respond to, cope with, and resiliently 
enhance itself against physical and cyber attacks 

• Providing power quality for modern equipment 
anticipated to be needed in the future 

• Accommodating all generation and storage options 

• Enabling new products, services, and markets 

• Optimizing assets and operating efficiently 

The transition to smart grid may in some cases be 
spontaneously driven by obvious benefits and cost- 
effectiveness, in others may be supported by regulatory 
and policy actions (see ► Sustainable Smart Grids, 
Emergence of a Policy Framework). 
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Glossary 

COE Cost of Energy: A method to calculate the elec¬ 
tricity costs taking into account investment and 
operation and maintenance costs. 

Control and safety systems Systems to keep small 
wind turbines within their operational range and 
to bring them back into a normal operational 
mode. 

Darrieus rotors A VAWT with curved rotor blades 
using a lift principle for its operation. 

DAWT Diffuser-Augmented Wind Turbine or ducted 
wind turbine. 

Drag Force in the direction of the incoming flow on 
a (streamlined) body. 

HAWT Horizontal Axis Wind Turbine. 

H-type VAWT A VAWT with rotor blades rotating 
along a cylindrical surface. 

Lift Force perpendicular to the incoming flow on a 
(streamlined) body. 

Savonius rotors “Crosswind” wind turbines with the 
main shaft perpendicular to the wind. 

VAWT Vertical Axis Wind Turbine. 

Definition of the Subject 

Small wind systems can be seen in autonomous oper¬ 
ation for pumping water, desalinization, and battery 
charging. A growing number of small wind turbines is 
connected to the grid, most of them behind the meter. 
This entry will focus on electricity-generating small 
wind turbines. Since the electricity production with 
wind turbines on buildings is a fairly new development 
with an interesting potential it will also be addressed. 
Safe and reliable operation of small wind turbines is 
technically challenging since they usually operate in 
more turbulent flows than large-scale wind turbines. 
A short description of the working principles of various 
types of small wind turbines is given; their safety and 
control systems are discussed as well as noise and 
vibrations. Furthermore, a treatment of the investment 


costs and the cost of energy is provided. Finally the 
implications for locating wind turbines on top of 
buildings are treated. 

Nontechnical barriers such as legislatory and regu¬ 
latory issues are not addressed, although they are very 
important for successful implementation of small wind 
turbines. 

Introduction 

There is a growing interest in the use of small wind 
turbines. The market is already significant and it is 
expected to grow at a similar rate as large-scale wind 
power. Small wind turbines are not only developed for 
remote applications in rural areas but also, to an increas¬ 
ing extent, for application in urban areas and on build¬ 
ings. This entry will address small wind turbines for 
electricity generation. Other important applications, 
such as wind pumps for irrigation, drainage, and the 
provision for drinking water, are not addressed. 

For further reading about wind pumping technology 
the reader is referred to Lysen [24] Kentfield [25] and de 
Jongh and Smulders [23]. 

In a strategy document issued by the American 
Wind Energy Association, a small wind turbine com¬ 
mittee chaired by Bergey [2], a projected total of 
75,000 MW wind power capacity by 2020 is provided 
by small- to medium-scale systems. With an average of 
2 kW per turbine this would mean around 30-40 
million units. Of this amount some 15,000 MW is 
foreseen at locations that are (also) suitable for rooftop 
mounting of wind turbines. A potential market of 
similar extent is present in the EU (Van Bussel 2005). 
Small wind turbines are also a very interesting option 
for small-scale energy supply at remote sites where no 
grid electricity is available and a grid extension would 
be too costly (e.g., for remote villages or homes). As 
such, they may have a great potential in developing 
countries. The most striking example is probably their 
dissemination among the nomads in Inner Mongolia 
(China), where over 100,000 small portable wind tur¬ 
bines (capacity about 100 W) are in use for “domestic” 
power supply. Such small off-the-grid wind turbines 
are often referred to as wind chargers [22]. 

Small wind turbines are not only developed for 
remote applications in rural areas but also, to an 
increasing extent, for application in urban areas and 
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on buildings. Yet the technology and the costs of 
small-scale wind deployment are not at the level of 
large-scale wind deployment. There is not a unique 
definition of what is regarded as a small wind turbine 
or a small wind system. The AWEA uses an upper limit 
of 100 kW and a maximum rotor diameter of 20 m [2] 
while the British Wind Energy Association [26] takes 
15 kW and an annual production of 50,000 kWh as the 
upper limit without referring to any rotor size. Others 
[28] take a maximum swept area of 50 m 2 as the 
criterion. In this entry, the viewpoint is taken that 
a small wind turbine system is a system that is able to 
provide (part of the) electricity consumption for a local 
(private) customer. 

Wind turbine deployment in the built environment 
is inherently more complicated than in rural, open 
spaces. In successfully exploiting wind energy in the 
built environment, three major issues can be identified: 
wind resource assessment and wind characterization 
around buildings, structural integration of wind tur¬ 
bines with buildings, and special urban wind turbine 
design requirements. 

Safe and reliable deployment of wind turbines in the 
highly turbulent built environment is specialized and 


technically challenging. In almost all situations, the use 
of existing (small) wind turbines will be problematic due 
to the fact that they are not adapted to the complex wind 
environment and to additional building-related require¬ 
ments. Some of these requirements are the severe noise 
restrictions and the ability to match the structural and 
esthetical integrity of buildings. This means that new 
ways need to be found in the design of new wind 
turbines that do fit the requirements for the built 
environment. 

Nontechnical barriers such as economics, bureau¬ 
cratic, and regulatory issues will not be addressed, 
although they are very important for successful imple¬ 
mentation of wind turbines in the built environment. 

The Feasibility of Small Wind Turbines 

This section shows a comparison between large 
and small wind turbines. The comparison is performed 
in order to show if there is space for small wind 
turbines in the built environment from an economical 
point of view. 

Consider a large wind turbine with a rotor diameter D 
of 70 m and a swept rotor area A s of 3,848 m 2 (Fig. 1). 
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Comparison of large and small wind turbines 
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The height from the ground to the nacelle and main shaft 
hfshaft is 85 m. The wind turbine yield is assumed to be 
800-1,100 kWh/m 2 /year, evidently depended on its 
location. This means that around 3,000-4,500 MWh/ 
year can be produced with a wind turbine of this size [8] . 

In the West European situation, this means that a 
few (3-10) m 2 of the swept surface suffices to meet the 
average electricity demand of one family. 

Now take a small wind turbine with a diameter of 
4 m at a 15-m tall mast. The swept area is then 
12.57 m 2 . With an average yield of 300 kWh/m 2 /year 
(feasible in a very good wind climate: an average of at 
least 6 m/s at hub height) this is sufficient for one 
family (3,770 kWh/year). It thus takes 1,000 of these 
relatively small wind turbines to provide the same 
amount of electricity. 

In normal utility grid operation it will be very 
difficult to be competitive in terms of the costs per 
kilowatt hour. eEither at remote locations where con¬ 
ventional electricity is very expensive and building 
large wind turbines is not possible or in situations 
where it is allowed to generate electricity behind 
the meter, since consumer price of electricity may easily 
be three times the production costs, small wind turbine 
may become a feasible solution. 

The consumer’s price of electricity was 0.22€/kWh 
in 2008 in the Netherlands. The Dutch consumer’s 
price is roughly built up out of generation of electricity 
(30%), taxes (40% inch carbon tax), and the distribu¬ 
tion and administration costs (30%). This consumer’s 
price may give an indication for the level of the cost of 
electricity which might make small wind turbines 
economically viable. 

Small Wind Turbine Technology 

Most of the modern small wind turbines typically have 
an upwind propeller type rotor, often directly coupled 
(without a gearbox) to an electrical generator (Fig. 2a). 
Such wind turbines are usually referred to as horizontal 
axis wind turbines (HAWTs). An increasing amount of 
small wind turbines pop up on the market with uncon¬ 
ventional rotor topologies, such as Savonius rotors 
(Fig. 2b), Darrieus, and H-type vertical axis rotors 
(VAWTs) (Fig. 2c and d), as well as horizontal axis 
wind turbine rotors equipped with a duct (DAWTs) 
(Fig. 4a). 


Vertical axis wind turbine can be further distin¬ 
guished through their principle of operation. Some 
are driven by drag differences between the forward 
moving and the retracting blade; others use the lift 
principle, similar to that of “propeller”-type HAWTs. 

Small wind turbines have mechanisms to limit the 
rotor speed and the forces on the rotor at high wind 
speeds. The rotor speed is usually allowed to vary with 
the wind speed. In the past, small wind turbines were 
often equipped with a direct current (DC) generator. 
Nowadays almost all machines are equipped with alter¬ 
nating current (AC) generators, most of them using 
Permanent Magnets (PM generators) which supply an 
electric output with variable voltage and variable 
frequency. An inverter is used to convert this variable 
speed AC into either fixed frequency AC to comply 
with the local grid convention, or to DC when the 
wind turbine operates in an off-the-grid mode using 
batteries for storage. 

Since the principle of operation is an important 
issue with respect to the aerodynamic efficiency of the 
machine it is important to address the aerodynamic 
theory of these machines shortly. 

Performance of Drag-Driven Wind Turbines 

The conversion of wind energy into rotational energy 
of drag-driven wind turbines is based on the drag 
difference of rotating bluff bodies (as in Fig. 3). 

An anemometer like drag-driven wind turbine is 
depicted. It consists of bluff bodies with different 
pressure drag induced by their shape. This difference 
in drag causes the turning moment. 

The “tip speed ratio” X is defined as X = ^ where co 
is the rotational speed, R the radius of the rotor, and U 
the incoming wind speed. 

The aerodynamic efficiency defined as 
Cp = i A where A is the total area swept by the 
wind turbine rotor projected on a plane perpendicular 
to the wind direction and P is the (aerodynamic) power 
extracted from the wind. The nondimensional coeffi¬ 
cient C P is called the power coefficient and is a measure 
of the (aerodynamic) efficiency of the machine. 
Where the theoretical maximum power coefficient of 
lift-driven wind turbines is Cp = ^, the maximum 
attainable C P of a drag-driven wind turbine can be 
approximated by Cp ~ ^ Q>, where C Dr is the drag 
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(a) Small HAWT (b) drag-driven VAWT (c) small lift-driven VAWT (d) H-type lift-driven VAWT 
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Left : AES drag-driven VAWT, right : aerodynamic working principle 
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coefficient of the retreating (moving with the wind) 
rotor blade. Here it is assumed that the drag coefficient 
of the advancing blade is zero: C Da = 0, where this is 
only the case when the advancing blade is completely 
sheltered from the wind. 

Drag coefficients of bluff bodies have maximum 
values up to 2; therefore the optimal aerodynamic 
efficiency of a drag-driven wind turbine will be up to 
^ ~ 0.3. In practice, drag-driven wind turbines usually 
have power coefficients below 0.1. This is in contrast to 
lift-driven machines which are able to achieve signifi¬ 
cant larger power conversion efficiencies. Large-scale 
HAWTs do have C P values ranging as high as 0.5 where 
small-scale wind turbines usually level off at values 
around C P = 0.35. 

Performance of Ducted Wind Turbines 

The performance of wind turbines equipped with 
a duct maybe larger than “bare” wind turbines. Figure 4 
shows an example of a small DAWT. 

As can be seen from the schematic on the right- 
hand side of Fig. 4, the duct is able to induce a larger 
mass flow through the rotor: the yellow and red colored 
areas indicate areas where increased velocities are 
found. This increased mass flow gives rise to larger 
performances of a propeller-type rotor located at that 
position. The performance of well-designed ducted 
wind turbines compares well with the performance of 
a “bare” HAWT when referencing to the exit diameter 
of the DAWT. In other words, if the exit diameter of 
a DAWT is the same as the rotor diameter of a HAWT, 
they are able to produce similar power at the same wind 


speed [6]. The duct of a DAWT acts as a tail vane and 
keeps the rotor oriented toward the wind. The possible 
largest advantages of a DAWT are the possibility to 
incorporate noise damping material into the duct and 
the fact that the outer area of the duct can be used for 
commercial and/or styling purposes. This, especially, 
holds for DAWTs operating on top of company 
buildings. 

Performance of Lift-Driven VAWTs 

The aerodynamic efficiency of a wind turbine depends 
upon other things on the way the wind energy is 
converted into mechanical power. 

A high aerodynamic efficiency can be achieved 
when the working principle is based upon lift forces. 
Such wind turbines make use of airfoil-shaped blades 
in order to generate their driving force. 

Lift-based VAWTs usually run fast and have slender 
blades (see Fig. 5), just like modern large-scale 
HAWTs. This is in contrast to drag-driven VAWTs, 
which achieve a significantly lower aerodynamic effi¬ 
ciency and use a large amount of rotor material. The 
Darrieus wind turbine (DWT) is the most widely 
known example of a lift-driven VAWT. Its working 
principle is depicted in Fig. 6. The energy extraction 
takes place during the cross-flow phases of the blades, 
that is, when the blades move perpendicular to the 
oncoming wind. With respect to a modern HAWT the 
Darrieus wind turbine has a somewhat lower efficiency 
(about 10%) in an ideal undisturbed low-turbulence 
wind environment. The main reason for this is that the 
rotor blades do not contribute to generating torque 
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(a) small Donqi® DAWT wind turbine (b) aerodynamic operation principle of a duct around a DAWT 
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Figure 5 

Globuan, a modern Darrieus wind turbine located on a flat 
roof 


This also gives rise to a fluctuating torque. Novel 
small VAWTs have mitigated these torque fluctuations 
by introducing a blade sweep. In Fig. 17, examples are 
shown of VAWTs with blade sweep. A second, large 
advantage of blade sweep is the significant reduction 
of noise. Normally high tip speed ratio (slender blade) 
Darrieus-type VAWT machines are very noisy, where 
the noise generated resembles the noise of an ascending 
helicopter. Another disadvantage of a lift-driven 
VAWT is its incapability to deliver a starting torque. 
This means that auxiliary systems are needed to get the 
machine running. Sometimes a small Savonius rotor is 
used, as in Fig. 5, but that has the disadvantage that 
the aerodynamic performance of the total system is 
influenced negatively in the normal operation mode. 
For this reason a start-up procedure is often 
implemented in which the generator of the wind 
turbine is temporarily used as a motor, to acquire a 
sufficient rotational speed. 

Control and Safety Systems 
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Figure 6 

Aerodynamic principle of operation of the Darrieus rotor 

when they are at the retreating and at the advancing 
phases of the revolution. On the contrary, then they 
generate drag only which has a detrimental effect on 
the performance. 


Each wind turbine needs a safety system to take action 
when the machine is getting out of its normal mode of 
operation. Also control systems are usually incorpo¬ 
rated in the design. Control systems are used for 
maximizing energy production while keeping the forces 
on the system within limits. Control and safety systems 
can be implemented using the same wind turbine com¬ 
ponents, but in terms of hierarchy the safety system 
should always be able to overrule the control system. 

Figure 7 shows a commonly used control and safety 
set up for a small HAWT. The machine is equipped 
with a tail vane. The first function of this tail vane is to 
keep the rotor oriented toward the wind. The tail vane 
of this system, however, is connected with a hinge to the 
body of the wind turbine. The tail vane hinge is tilted 
with respect to the direction of gravity (Fig. 7a). This 
causes the tail vane to prefer a “lowest” position due to 
the weight of the tail vane. For zero and low wind 
speeds the tail vane will be in this position keeping 
the rotor perpendicular to the wind and as such max¬ 
imize the energy production for low wind speeds. 

The rotor shaft, connected to the body of the 
turbine, has an offset with respect to the yaw bearing 
on top of the tower. Thrust on the rotor will initiate 
a yaw moment turning the rotor out of the wind. The 
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Electricity Generation with Small Wind Turbines. Figure 7 

Fortis Montana® control and safety system 


tail vane however prevents yawing. This is possible 
because the hinge of the tail vane has an offset 
and hence it can develop a yawing moment 
counteracting the yaw moment due to thrust on the 
rotor. But when the thrust on the rotor is getting too 
large, the aerodynamic force on the tail vane will cause 
the vane to rotate around its tilted hinge against gravity 
forces. This reduces the aerodynamic force on the tail 
vane and as a consequence, the thrust on the rotor will 
initiate a yawing movement leveling off the rotor 
thrust. 

So in this hinged tail vane set up, the tilted hinged 
vane both has a control function as well as a safety 
function, since very large thrust forces are prevented 
by yawing the rotor out of the wind. 

Apart from the tilted hinged vane other control and 
safety systems are found at small wind turbines. 

Figure 8 shows two examples of safety systems 
activated by over speeding of the rotor. The ESI® 
wind turbine is equipped with tip brakes which are 
kept in place by springs in normal operation. Another 
commonly applied system uses centrifugal governors 
causing the blades to pitch toward feather when the 
rotational speed becomes too high. 

Small wind turbines equipped with a permanent 
magnet direct-driven generator also may have a safety 
system through applying short circuit in the electric 
windings, causing the machine to stop. 


This is a safety system that is also often applied in 
VAWTs since yawing the machine out of the wind 
and/or pitching the rotor blades are usually not an 
option in vertical axis designs. 

Noise and Vibrations 

For small wind turbines noise and vibration can be 
critical issues. Usually small wind turbines operate in 
the vicinity of people, which imply that noise levels 
should be within acceptable (legislation) levels. 

Vibrations tend to reduce lifetime, but are a vital 
issue for roof top locations since they might give yield 
to induced noise inside the building. 

Noise Background noise level in rural areas starts at 
25 dB(A) [9] at a 10 min averaged wind speed of 4 m/s 
and increases with wind velocity (see Fig. 9). 

The noise level in a built environment depends on 
local noise production, for instance, traffic. Due to an 
increased human activity noise in the built environ¬ 
ment will be higher compared to rural areas. 

When the wind turbine is modeled as a point source 
with hemispherical spreading, the noise shows a reduc¬ 
tion of 6 dB(A) per doubling of distance to the source 
(Fig. 10). The distance from the source to the location 
of noise perception is important, and the sound pro¬ 
duced by the wind turbine should ideally be at a level 
beneath or at least equal to the background level. 
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Electricity Generation with Small Wind Turbines. Figure 8 

(a) ESI® wind turbine with tip brakes (b) passive pitch system using a centrifugal governor, reducing rotor thrust at high 
rotational speeds (e.g., [22]) 
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Electricity Generation with Small Wind Turbines. Figure 9 

Prevailing background noise level in rural areas at an average 10 min [14] 


An obvious approach to reduce aerodynamic noise 
emission is to lower the rotational speed of the wind 
turbine. Here a big advantage of drag-driven VAWTs 
comes in because of their inherently low rotational 
speed and hence noise production. 

Lift-driven VAWTs have a big disadvantage with 
respect to noise production. Early (large-scale) 
Darrieus machines were notorious noise producers, 
where the cause of the noise production was the 
unsteady aerodynamic loading of the blades during a 


revolution. Mitigation of that noise source has taken 
place in modern small-scale VAWTs either by reducing 
the tip speed significantly (e.g., Fig. 2d) or by intro¬ 
ducing a blade sweep (see Fig. 17c). 

Vibrations Vibrations occurring in structures should 
be minimized since they cause fatigue damage. Proper 
design of the components of a small wind turbine and 
proper choice of tower and foundation should result 
into low vibrations. Crucial in this respect is the tuning 
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Electricity Generation with Small Wind Turbines. Figure 10 

Wind turbine noise level at distance r for a point source noise emission of 70 dB(A) [14] 


E 


of the natural frequencies of the tower. They should be 
outside the rotational frequency/frequencies (and 
frequencies equal to n times the rotational frequency 
where n is the number of blades). 

Compared to large machines, small wind turbines 
operate at a much larger rotational frequency co. 
The reason is that the wind U does not scale with the 
size and since a rotor is designed to operate at a given 
tip speed ratio X where X = ^, the co must be larger 
when the radius R goes down. This is a potential noise 
hazard since this might mean that vibration frequencies 
are getting in the audible range (>20 Hz) and as such 
may contribute to noise production. 

Wind turbines may generate vibrations at frequen¬ 
cies starting at the rotational speed and the higher 
frequencies inco where i is an integer, n is the number 
of blades, and co is the rotational speed of the wind 
turbine. Vibration of the nacelle is often a consequence 
of mass and/or aerodynamic unbalance of the rotor 
and can thus be avoided by proper design and mainte¬ 
nance. But even well-balanced blades moving through 
turbulent structures in the incoming wind may 
cause vibration frequencies as well. Compared to 
large machines, small wind turbines operate at a 
much larger rotational frequency co. 

Lift-driven VAWTs will induce frequencies in the 
building at co (mass unbalance) and 2 inco (aerodynamic 
vibrations) since during one revolution a blade passes 
turbulent structures twice: first at the windward side of 
the VAWT and again at the leeward side of the VAWT. 


Consequences for the operating wind turbines on 
buildings are evidently that operating at the natural 
frequencies of parts of the building, such as floors and 
walls should be avoided. The fundamental frequency of 
the building itself, which can be approximated by 
fe = Y~ h according to Mertens [14], is usually well 
below the operating frequencies of the wind turbine. 

Comparison of Different Types of Wind Turbines 

The comparison of the wind turbines based upon their 
rotor topologies is provided in Table 1 below. The 
different systems are ranked according to their aerody¬ 
namic efficiencies. But this is evidently only one of the 
aspects that have to be taken into account. Auxiliary 
systems such as systems to yaw the machine in and out 
of the wind as well as control and safety systems need to 
be taken into account as well. Furthermore also the 
costs of the total system will play a role in considering 
what may be the best option. 

Costs of Small Wind Turbines 

The costs of small wind turbines are of evident impor¬ 
tance. At first, the production of a given machine should 
always be considered in regard to the investment costs 
and the operational costs, and second, an assessment of 
the real total costs is important for making an economic 
trade-off between the choice of different (renewable) 
energy sources. 
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Electricity Generation with Small Wind Turbines. Table 1 Comparison of different small wind turbine systems based 
upon their rotor topology 


DAWTs (Diffuser- 
Augmented Wind 
Turbines) 

HAWTs (Horizontal Axis 
Wind Turbines) 

Lift-driven VAWTs 
(Vertical Axis Wind 
Turbines) 

Drag-driven VAWTs 
(Vertical Axis Wind 
Turbines) 

Advantages 

Aerodynamically most 
efficient 

Aerodynamically most 
efficient 

Aerodynamically 

efficient 


Wind direction 
insensitive 

Wind direction 
insensitive 

Turbulence insensitive 

No noise 

Well proven 

Easy access to generator 

Easy access to generator 

Most widely used 

Well proven 

Robust 

Disadvantages 

Large and expensive 
duct needed 

Wind direction change 
sensitive 

Needs start-up aid 

Aerodynamically 

inefficient 

Tail vane/yaw system 
needed 

Large amount of rotor 
material 

Does not perform well in 
highly fluctuating winds 


Cost of Energy 

The cost of energy (COE) is calculated with a 
recommended procedure of the International Energy 
Agency (IAE) (e.g., [27]). 


COE = 


CRF • I 
APE 


TOM 

APE 


where: 


COE: Cost of energy [€/kWh] 

I Investment wind turbine costs [€] 

CRF Capital Recovery Factor : yearly interest [%/year], 
depending on interest rate i and economical 
lifetime n: 


CRF = 


»••(! + »•)" 
(i + 0"-i 


APE: annually produced energy [kWh] 

TOM: total yearly operation & maintenance costs [€] 


When an interest rate of 6% per year is assumed as 
well as an economical lifetime of 15 years [28], this 


results in a CRF of 0.103. The investment costs of main 
components of a small wind turbine will be discussed 
below. 

Wind Turbine Costs 

Information of wind turbine component costs has been 
gathered extensively by Van Beveren [4] . Wind turbine 
costs and component costs are determined based upon 
the values in the year 2006 and are valid for wind 
turbines with a nominal power up to 10 kW. Costs 
are given as costs per swept rotor area, since this is the 
best ratio for comparison. Wind turbine costs could 
also be divided by its nominal power P, but since this 
value is not unambiguous, often determined by 
wind turbine manufacturers themselves and seldom 
overestimated, it is not an objective measure. Note the 
vertical logarithmic scale. If wind turbine costs include 
costs for installation, this is indicated by the label. 

The trend lines are shown in the Fig. 11. 

Costs of HAWTs, the most common small wind 
turbine design as function of the swept rotor area, 
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o 

Fortis incl. installation 

X 

Enflo ® 

▲ 

Turby®incl. installation 

o 

Rivertrade 

□ 

Proven 

A 

ARE® 

□ 

Westwind 

■ 

Tulipo ® 


HDT, DCT 

o 

Bergey 

X 

Windwall ® incl. installation 

X 

Swift ® incl. installation 

A 

Provane ® incl. installation 

♦ 

Venturiwind ® 

o 

Eoltec (Sirocco ®) 

■ 

Windside ® 

— 

HAWT 

— 

Savonius 


Electricity Generation with Small Wind Turbines. Figure 11 

Year 2006 wind turbine costs and swept rotor area as specified by various manufacturers 


comply roughly with Chawt = 500 • A 0-9 , where the 
costs of DAWT and lift-driven DAWT can be described 
by Cdawt = 4000 • A 0 6 . The cost data of Windside® 
turbines, a typical drag-driven VAWT, show even higher 
costs and are described by Cs av0 nius = 6000 • A 0 6 . These 
trend lines are also shown in Fig. 11. 

As can be seen from this figure, the standard HAWT 
has by far the lowest cost to area ratio, though there is 
also a large spread in the values. The Proven machines 
turn out to be almost twice as expensive as the Bergey 
machines, while both are respected manufacturers with 
a relatively large production. What can be seen is that 
the small HAWT costs curve is only slightly dependent 
upon swept area (diameter in this case). The other 
types of wind turbines show a different trend. It should, 
however, be realized that the amount of data available 
for these machines is limited, as well as the number of 
produced machines. 

Tower Costs 

A distinction is made between lattice towers, steel 
guyed masts, supported tube towers, and free standing 
tube towers (see Fig. 12 taken from [4]). The costs of 
these towers as shown in the figure are for tower heights 
of 5-37 m. 

The costs of the tubular towers of the lift-driven 
VAWT Turby® deviate substantially from the other 
designs. The most probable reason is that these are 


free-standing towers, where guyed towers are mostly 
used for Darrieus-type machines. Turby® though was 
designed for roof top application where guy wire con¬ 
nection may cause substantial additional installation 
costs, which are avoided in the current design. 

Costs of Foundation and Installation 

Costs of foundation and installation are based upon the 
WINEUR report [28] and further data of the manufac¬ 
turers, a.o Fortis and Proven. Some towers need a 
foundation of concrete, others use a base plate with 
guyed wires attached to separate anchors. Evidently the 
installation and foundation costs depend very much on 
location, type of tower, and ground surface. Turby® has 
designed a special support structure for the installation 
upon a building. Of all cost elements of a small wind 
turbine these are costs which are most probably best 
suited for DIY activities, in order to bring the total 
investment down. A typical estimate for the profes¬ 
sional costs of foundation and installation of small 
wind turbines is between 10% and 20% of the total 
investment. 

Operation and Maintenance Costs 

On top of the investment cost the Operation and main¬ 
tenance (O&M) costs contribute to the total cost of 
energy. In the COE calculation, these are represented 
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Electricity Generation with Small Wind Turbines. Figure 12 

Tower costs as function of tower height 


in the TOM component. Total O&M (TOM) costs 
are an addition of inspection and maintenance costs 
C IM and the cost of replaced parts C RE and thus: 
TOM = C/m + Cre- 

Inspection and Maintenance Visit Costs Most wind 
turbine manufacturers advise to have the wind turbine 
checked at least twice a year. The costs associated are 
low if the wind turbine is inspected by the user. Nor¬ 
mally inspection of the machine can be done within 
1.5 h for an average-sized wind turbine with a swept 
rotor area of 10 m 2 . Larger machines may take more 
time, mainly caused by the extra time and effort to 
access the nacelle at larger heights. 

Replacement Costs The replacement costs are 
estimated with the failure rate of the wind turbine 
parts. These failure rates are based on an assessment 
of Van Beveren [4] covering a period of 10 operational 
years of small wind turbines and values for large-scale 
wind turbines of Van Bussel [3] (Table 2). 

These values indicate that once every 1-3 years 
a part of a wind turbine fails and needs replacement. 
This is significantly better than for large-scale wind 


Electricity Generation with Small Wind Turbines. 
Table 2 Estimate of failure rates of small wind turbine 
parts 


No. 

Wind turbine parts 

Failures/year 

1 

Rotor blade 

0.02-0.03 

IS 

Rotor blade Savonius 

0.001-0.01 

2 

Shaft 

0.001 

3 

Hub 

0.001-0.05 

4 

Bearings 

0.1-0.2 

5 

Generator 

0.02-0.04 

6 

Power regulator 

0.03-0.06 

7 

Inverter 

0.1-0.2 

8 

Frame 

0.002-0.004 

9 

Tail 

0.03-0.06 

10 

Tower 

0.01-0.02 

11 

Foundation 

0.001 

12 

Cables 

0.001 

13 

Anemometer and data logger 

0.030 

14 

Lightning protector 

0.001 
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turbines, maybe because smaller wind turbines allow 
for a larger safety margin in the design. 

Impact of O&M Costs on Energy Production 

Downtime due to inspection, maintenance, and repair 
is not included in the TOM costs, but is found in the 
APE part of the COE calculation. Failure rates and 
inspection times, however, indicate that the availability 
(defined by the ability to produce if wind conditions 
are within the operational window) of small systems 
is larger than for large-scale systems. Since modern 
large-scale wind turbines show values beyond 98% 
downtime, production loss for small systems should 
not be an issue. But this is of course subject to the 
logistics of spare part handling. 

Indications of APE cannot be given, since they 
strongly depend upon the local wind climate. In general 
terms, a kWh production of more than 1,000 kWh/m 2 
per year swept rotor surface is feasible for large-scale 
wind turbines in good wind climates. Small-scale wind 
turbines are usually located at less favorable sites and 
operate at lower heights. Then a production level of 
300 kWh/m 2 per year is considered to be feasible on 
a good location. 

Application of Small Wind Turbines on Rooftops 

A nice existing example of a wind turbine-building 
combination can be found at the Saxion Hogeschool 


in Deventer (NL). Figure 13 shows the building and the 
drag-driven VAWT on top of it. 

The vertical axis wind turbine is used as an exper¬ 
imental setup for students. In the second instance, the 
similar wind turbine provides part of the electrical 
power for the illumination of the staircase. The wind 
turbines used on these buildings are modifications of 
existing products on the market. 

Requirements for Wind Turbines on Rooftops The 

design of such urban wind turbines is driven by 
the following requirements: 

• Good performance in complex winds 

• Safe operation in the urban environment 

• Low noise level 

• Simple, rugged design 

• Minimized maintenance 

• Aesthetic appearance 

With respect to the first point it already becomes 
clear that the design solution will look different for 
application on rooftops of buildings. HAWT-type 
wind turbines make use of a yawing system to track 
variations in wind direction. In urban areas, and defin¬ 
itively on top of buildings, wind direction variations 
can be considerable. HAWTs then have a clear disad¬ 
vantage, since the yaw system is in general not able to 
follow these fast and extensive variations. This may 


E 



Electricity Generation with Small Wind Turbines. Figure 13 

(a) Saxion building, Deventer, the Netherlands; (b) Drag-driven VAWT wind turbine on top of the central stairway 
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result in a continuous misalignment of the wind tur¬ 
bine with the direction of the wind, and an inherent 
loss in efficiency. This is a major reason for introducing 
VAWTs for rooftop application. 

Flows over Rooftops Flows around buildings are 
inherently complex. At first, the topology of the terrain 
in the oncoming direction of the wind determines the 
local wind structure in terms of average wind speed; 
vertical gradient and turbulence intensity. Second, the 
topology of the building itself and of adjacent buildings 
will determine the local structure. 

An example of the complex flow conditions that 
occur around a square building is shown in Fig. 14. 
Here the flow around a square obstacle is visualized. 
A separation bubble on top of the obstacle, starting at 
the leading edge can be observed as well as a closed cell 
vortex on the windward side and a large separation area 
with recirculation at the leeward side. Unfortunately, 
the details of the flow, such as depicted in Fig. 14, are 
very difficult to predict, even with state-of-the-art 
advanced computational (CFD) methods. Yet Mertens 
[15] showed that it is possible to develop a fairly simple 
theoretical model to determine the performance of 
wind turbines located in such complex flows. His 
calculations show that a significant flow acceleration 
effect is present on top of the building just outside the 
separating streamline It is, however, restricted to 
a fairly small area. In this area, the wind will be about 
20% higher than the undisturbed wind speed far in 



Electricity Generation with Small Wind Turbines. 
Figure 14 

Flow over a high-rise building showing velocity increase 
and flow separation [14] 


front of the building [14]. Figure 15 shows the result of 
the model for a high-rise building which is well 
exposed to the incoming wind. 

The results of the calculations of Mertens for a 
two-dimensional building (a building with infinite 
horizontal crosswind dimension) are shown in this 
figure for different values of the roughness height z 0 
of the surface upwind of the building. The quantity b in 
this graph refers to the building depth, which is the 
horizontal size of the building in the direction of the 
incoming wind. Also shown is the result obtained by 
Wilson [21] for a square building. As can be seen there 
is a reasonable agreement between both results, for an 
upwind roughness height of 1 m, certainly for locations 
farther away from the windward edge of the building. 

From further calculations of Mertens [15] it became 
clear that for a wind turbine located at the center of the 
roof a minimum height of 25% of the depth of 
a building is required for the lowest part of the rotor. 
For typical residential apartment buildings or office 
buildings this is in the order of 12-20 m. This means 
minimal tower height of 5 m underneath the rotor. 

Suitable Wind Turbine Dimensions In the case the 
whole wind turbine is located in the higher wind speed 
region the energy increase is equal to the third power of 
1.2. Thus, the aerodynamic power increase is 1.7 com¬ 
pared to a wind turbine in the same parallel flow without 
the building. When the wind turbine is located on the 
right spot on the roof this increase can be achieved for all 
wind directions. It, however, limits the dimensions of the 
wind turbine to something in the order of 20% of the 
characteristic height of a building. For uniform parallel 
oncoming flow over a mid- to high-rise its building 
height is equal to this characteristic height. Thus the 
limitation yields: D < 0.2 h. For mid- to high-rise build¬ 
ings with a characteristic dimension of say 50-100 m, 
the wind turbine diameter will, therefore, be from 10 
up to about 20 m, in order to take full advantage of the 
speedup effect. Wind turbines of such sizes will gener¬ 
ally not require significant modifications of the build¬ 
ing structure, which is evidently an advantage when 
retrofitting buildings with renewable wind power. 

Wind Turbines Developed for Operation on Buildings 

A number of companies have seen the potential for 
wind turbine application on top of existing buildings 
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Electricity Generation with Small Wind Turbines. Figure 15 

Location of the separating streamline for a two-dimensional building (20 m high, 10 m deep) [15] 



Electricity Generation with Small Wind Turbines. Figure 16 

High solidity and predominantly drag-driven VAWTs of various make 


and have anticipated on the coming market with the 
development of dedicated machines. From the 
requirements for small wind turbines operating on 
buildings it is clear that vertical axis machines have 
a preference. Three examples of commercially avail¬ 
able VAWTs for roof top application are shown in 
Fig. 16. 

In similar fashion, examples of lift-driven VAWT 
designs are given in Fig. 17. It can be clearly seen that all 


three designs are derivatives of the Darrieus wind tur¬ 
bine depicted in Fig. 2c. In the DWT design use is made 
of curved blades, just as in the original design. 
The Neoga® adds skewness of the blades, whereas the 
Turby® combines a cylindrical swept area together with 
blade sweep. 

Since Turby® is the most deviating design, when 
compared to the original Darrieus design, it is worth¬ 
while examining this particular design somewhat further. 
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Electricity Generation with Small Wind Turbines. Figure 17 

Lift-driven VAWTs of various make 



H-type Lift-Driven VAWT with Skewed Blades The 

three blades of the Turby® rotate along a cylindrical 
surface. Compared to the classical Darrieus rotor this 
has the advantage that a larger swept area is obtained 
for the same blade length. A disadvantage is the addi¬ 
tion of struts in the center necessary for structural 
reasons. They will contribute to the viscous drag of 
the machine and thus reduce its efficiency. Further¬ 
more, the stress path in the blades due to centrifugal 
loads is no longer aligned with the blade itself. At this 
point it should, however, be noted that the design “tip” 
speed ratio of Turby®, the ratio between circumferen¬ 
tial speed and wind speed, is significantly less than for 
a classical Darrieus rotor (speed ratio X - 3 for Turby® 
where X - 5-7 for classical Darrieus rotors). 

The blades are furthermore swept. This complicates 
the load path through the rotor further, but is applied 
because it has a very favorable effect upon noise pro¬ 
duction. The combination of low X together with blade 
sweep makes it possible to comply with the noise levels 
allowable in the built environment. More details 
regarding the development and the design of Turby® 
can be found in Van Bussel et al. [5]. The Quit Revo¬ 
lution lift-driven VAWT is another implementation of 
the above described design philosophy (see Fig. 18). 

The fact that these machines are not self-starting is 
often seen as a major disadvantage. However, when 
located on top of building the grid infrastructure is 
present to use the generator of the wind turbine 
temporarily used as a motor for kick-starting the 
machine at sufficient wind speeds. 


Electricity Generation with Small Wind Turbines. 
Figure 18 

Turby® prototype at TU Delft open-air research site 
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Standards and Certification 

Generic design requirements for small wind turbines 
are provided in the standard IEC 61400-2, ed.2, “Wind 
Turbines - Part 2: Design Requirements of Small 
Wind Turbines” provided by the International 
Electrotechnical Committee. 

Local standards and testing procedures for small 
wind turbines are developed by consensus through 
independent third-party authorities, such as the 
Institute of Electrical and Electronic Engineers (IEEE) 
and Underwriters Laboratories (UL) in the USA. Often 
use is made of the above mentioned IEC 61400-2 
standard and of (components of) standards for 
large wind turbines such as IEC 61400-12-1 ed.l, 
“Wind Turbines - Part 12-1: Power Performance Mea¬ 
surements of Electricity Producing Wind Turbines” 
and IEC 61400-11 ed.2, “Wind Turbine Generator 
Systems - Part 11: Acoustic Noise Measurement 
Techniques,” and of other standards applied or modi¬ 
fied for small wind turbines. According to AWEA [1], 
the standards UL 1741 and IEEE 929 have been suc¬ 
cessfully used to certify inverters for small wind 
turbines. 

On top of that the IEEE is working on standard 
IEEE PI547 for intertied “distributed generation” 
technologies including small wind turbines [ 1 ]. 
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Definition of the Subject 

The electrochemical production of hydrogen by photo- 
electrolysis or electrolysis is one of the most promising 
ways for making high-purity hydrogen. Water electrol¬ 
ysis is as old as the field of electrochemistry itself, and 
this entry focuses on some of the fundamental princi¬ 
ples that have been established through the years. 

Introduction 

Hydrogen (H 2 ) is considered by many as the fuel of the 
future. In view of the current interest and importance 
in securing the sustainability of our society’s energy 
needs, many research efforts are invested in making 
hydrogen, storing hydrogen, and converting hydrogen 
back to work. One of the most promising ways to make 
clean “solar hydrogen” is by electrochemistry or elec¬ 
trolysis, either in combination with a photovoltaic cell 
or in an integrated photo-electrochemical device. 
A crucial component of such a device is the catalyst 
converting water (or protons) into hydrogen. Such 
a catalyst must be efficient, stable, robust, cheap (or 
“cost-effective”) and be made from abundant sources. 

This entry will consider some fundamental con¬ 
siderations associated with the catalysis of the electro¬ 
chemical hydrogen production. The entry will focus 


on the energetic and thermodynamic requirements for 
such a catalyst [1-3] as opposed to the more usual 
approach (especially in the field of hydrogen evolution 
by molecular catalysts) in which emphasis is often on 
structural aspects of the catalyst. Of course, in an all- 
encompassing approach or theory, the two must be 
related, but in the author’s view the approach empha¬ 
sizing the energetic relationships provides a very 
useful general and basic framework for later, more 
detailed, approaches. The entry will also discuss 
these more detailed electrochemical models as well 
as the role of the solvent in the hydrogen evolution 
reaction (HER). 

General Theory 

Thermodynamic Theory 

The hydrogen evolution reaction (HER), or the reverse 
hydrogen oxidation reaction (HOR), is a classic exam¬ 
ple of a two-electron transfer reaction 

2H + + 2e~ U H 2 (1) 

The reaction has a standard equilibrium potential 
1±h 2 /h + of 0 V. The mechanism for the hydrogen oxida¬ 
tion/evolution reaction on metals will be considered 
first [4]. Later, this will be extended to a more general 
mechanism that encompasses the framework suggested 
for molecular catalysts. The classical electrochemical 
mechanism consists of three steps: 


H + + * + e u H a( j s 

(la) 

H a ds + H + + e tz H 2 + * 

(lb) 

2H ads Z;H 2 + 2* 

(1c) 


known as the Volmer, Heyrovsky, and Tafel steps, 
respectively. The symbol * stands for a free adsorption 
site on the metal electrode surface, but it may also be 
considered as a coordination site for hydrogen in a 
molecular catalyst. The H ads intermediate is a hydrogen 
species chemisorbed to such a site on the surface or in 
the molecular catalyst. Note that in reactions la and lb, 
the proton and electron transfer are assumed to take 
place simultaneously. The concerted character of the 
proton-electron transfer for the Volmer reaction on 
Pt( 111) is supported by recent detailed ab initio density 
functional theory (DFT) calculations [5, 6]. 
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The above multistep mechanism needs to satisfy 
thermodynamic constraints in the sense that the com¬ 
bined equilibrium constants need to conform to the 
Nernst equation of the overall reaction [7]. Let us 
assume that the reaction follows a Volmer-Heyrovsky 
mechanism, in which case: 


4[ + /H ads + 4 

ads 5 

The equilibrium potential of a redox reaction 
involving an adsorbed or chemisorbed state can be 
calculated if one defines the concomitant standard 
state. Consider the Volmer reaction la, for which the 
equilibrium condition is 

AG(H ads ) = A G(H+ + e~) (3) 


Following Norskov et al. [8], the term on the right- 
hand side is 0 at E = 0 (vs NHE) by definition, and — e 0 E 
at any other potential E , assuming that when the poten¬ 
tial is changed only the energy of the electrons changes, 
and nothing else. The latter assumption may be tested 
by DFT calculations, showing that the binding energy 
of H ads does not depend strongly on potential, or on 
an applied electric field [9-12]. This implies that the 
static dipole moment of the H ads bond to the surface 
is close to 0, and that the electrosorption valency 
corresponding to reaction la is close to 1 [12]. There¬ 
fore, the standard equilibrium potential for the Volmer 
reaction follows as: 


4+/H ads 


AG°(H ads ) 

0 ) 


( 4 ) 


where AG°(H ads ) is the free adsorption energy of H ads 
at some suitably defined standard state “0” (for 
instance, one-fourth coverage on a (111) surface), 
with respect to the energy of H 2 , which is equal to 
0 also by virtue of the above definition. If AG°(H ads ) 
< 0, i.e., H ads formation onto the catalyst surface yields 
a thermodynamically more favorable situation than H 2 
evolution, f^+/ H d > 0. A well-known example of this 
is hydrogen underpotential deposition (UPD) on plat¬ 
inum and other transition-metal electrodes. Note, 
however, that in terms of the overall reaction, such a 
situation implies 


4r/H lds - 4 ads ,H+/H 2 < 0 ( 5 ) 


so that the Heyrovsky reaction would have a standard 
equilibrium potential (defined for the same standard 
state) below 0 V. Neglecting kinetic factors, this analysis 
would suggest that if in the HER, H ads formation would 
be a thermodynamically favorable reaction at E = 0, 
molecular hydrogen formation as in the Heyrovsky 
reaction must be thermodynamically unfavorable. Fol¬ 
lowing Norskov et al. [8], one could argue that in such 
a case, the minimal “thermodynamic overvoltage” for 
the overall reaction, rj T , would be equal to the standard 
equilibrium potential of the thermodynamically least 
favorable reaction, i.e., rj T = E^+ Ud /u 2 — ^h+/h 2 ' This 
“thermodynamically least favorable reaction step” is 
termed the potential-determining step (not to be con¬ 
fused with the rate-determining step, though often they 
may correspond to the same reaction step). Since the 
lowest possible “thermodynamic overvoltage” is 0, it 
follows that, on the basis of these thermodynamic con¬ 
siderations, we are looking for a catalyst with: 

Vt = 4+/H ads = 4a*,H+/H 2 = “ - = 0 ( 6 ) 

Any catalyst with AG°(H ads ) ^ 0 will have a rj T > 0 
as either the Volmer reaction has a standard equilib¬ 
rium potential more negative than 0 (AG°(H ads ) > 0) 
or the Heyrovsky reaction has a standard equilibrium 
potential more negative than 0 (AG°(H ads ) < 0). 
Figure 1 plots r] T for the HER as a function of 
AG°(H ads ), and illustrates how the optimal catalyst is a 
compromise between strong and weak binding of H ads , 
as expressed by the classical Sabatier principle [13]. 
Figure 1 is similar but not identical to a volcano rela¬ 
tionship, in which the catalytic activity for a reaction is 
plotted as a function of some property of the catalyst, in 
this case the catalyst’s ability to bind an intermediate. 
Typically such a plot leads to peak-shaped relationship of 
the catalytic activity versus the catalyst property, whence 
the name “volcano relationship.” Such a catalyst prop¬ 
erty is also known as a “descriptor,” as it gives a general 
but essential description of the catalyst properties. 
Although the “thermodynamic overpotential” of 
multistep electron transfer reactions is not a kinetic 
measure, the concept of the volcano relationship and 
associated Sabatier principle straightforwardly follow 
from thermodynamics combined with some very mild 
kinetic assumptions (one of the assumptions that one 
may make is that of a relation between activation energy 
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Electrochemical Hydrogen Production. Figure 1 

"Thermodynamic" volcano plot for the hydrogen evolution 
reaction {HER) and the hydrogen oxidation reaction ( HOR ): 
thermodynamic overpotential (standard equilibrium 
potential of the potential-determining step) versus the free 
binding energy of hydrogen on the electrode surface 
[1 5, 1 7]. The dashed line gives the same plot for the HOR. 
The potential-determining steps corresponding to the 
various legs of the curves are indicated 

and thermodynamic driving force, i.e., the Bronsted- 
Evans-Polanyi postulate [14]). Therefore, I consider 
Fig. 1 as a thermodynamic version of the kinetic vol¬ 
cano plot (i.e., log z 0 vs AG°(H ads ), with i 0 the standard 
exchange current density), as originally suggested for 
the HER by Parsons [15] and Gerischer [16], and 
recently revived on the basis of DFT- calculated values 
for AG°(H ads ) by Norskov et al. [ 17] . Note that the same 
analysis can be applied to the HOR, leading to the 
dashed-line volcano plot in Fig. 1, illustrating that for 
a reduction reaction we obtain a “A’’-shaped plot, and 
for an oxidation reaction we obtain a “ V’’-shaped plot. 

The same analysis maybe applied to a Volmer-Tafel 
mechanism for the HER/HOR. The only difference 
now is that reaction lc does not have a standard equi¬ 
librium potential, because it is not an electrochemical 
reaction. However, combined with reaction la, it must 
still satisfy the overall thermodynamic constraints of 
reaction 1. Defining the standard equilibrium constant 
of reaction lc as: 


k h^/h 2 = exp 


—AG°(H ads ) 
RT 


( 7 ) 


the Tafel-Volmer mechanism has to obey the following 
relation: 




/H a ds 


AG°(H ads ) 


= 0 


eo 


(8) 


In case F^ H d < 0, the thermodynamic overvolt¬ 
age is rj T = d > as in the Heyrovsky-Volmer 

mechanism. In case E^ + j H d > 0, the Tafel reaction 
becomes endothermic (Eq. 8), and therefore thermo¬ 
dynamically unfavorable. The most straightforward 
definition of a “thermodynamic overvoltage” would 
then follow from Eq. 8, leading to r] T = — E^+/ H d • As 
a result, we see that the Heyrovsky-Volmer and the 
Tafel-Volmer mechanisms lead to “thermodynamic 
volcano curves” that are identical, and that more mech¬ 
anistic assumptions are needed in order to distinguish 
between these two mechanisms. One important conse¬ 
quence of a volcano-type relationship such as that 
illustrated in Fig. 1 is that the potential-determining 
step is different on each side of the maximum. “Poten¬ 
tial determining” means that this step determines the 
(over)potential at which a certain current density may 
be reached. For AG°(H ads ) > 0, the Volmer reaction is 
potential determining; for AG°(H ads ) < 0, either the 
Heyrovsky or Tafel reaction is potential determining. It 
is important to point out the difference between poten¬ 
tial-determining and rate-determining step. Potential¬ 
determining steps are not always rate determining at 
the relevant potentials, as potential-determining steps 
suffer from an unfavorable thermodynamics, but not 
necessarily from an unfavorable kinetics at the poten¬ 
tial at which the current finally starts flowing. Never¬ 
theless, this simple analysis lucidly illustrates the 
thermodynamic constraints applicable to multistep 
multi-electron transfer reactions, and how catalysis 
may influence the “thermodynamic overvoltage” by 
optimizing the binding energy of the key intermediate, 
in this case H ads . The best catalyst, according to this 
analysis, is the one that binds hydrogen in such a way 
that there is no thermodynamic sink or barrier in going 
from reactant through intermediate to product. This is 
the essential point made already by Parsons more than 
50 years ago [15]. Experimental volcano plots of the 
exchange current density of the HER have been 
suggested many years ago by Trasatti, [18] and such 
plots may be criticized for various (experimental) 
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reasons [19, 20]. In this section, we will summarize 
some of the main objections. The section Hydrogen 
Evolution at Metal Electrodes will review the experi¬ 
mental evidence for the existence of a volcano relation¬ 
ship for the HER. At this point, it is stressed again that 
the approach discussed here does not include explicitly 
any kinetic aspects of the reaction. 

The idea recently suggested by Norskov et al. [17] is 
that AG°(H*) may be calculated for many potential 
catalysts from first principles by using DFT in combina¬ 
tion with some mild assumptions on entropic contribu¬ 
tions. In a DFT calculation, one typically calculates the 
binding energy of the hydrogen atom (or radical): 

Catalyst+H* —> Catalyst — H E =Ebi n d (9) 

The free binding energy using H 2 as a reference can 
then be estimated from 

AG°(H ads ) = E bind - T*AS + AZPE + E(H*) 

( 10 ) 

The term T* AS takes into account the entropy con- 
tribution due to the vibration of hydrogen in its adsorbed 
state, and AZPE is the corresponding zero-point energy 
of the adsorbate. Their combined (— T*AS + AZPE) 
value has been estimated for H adsorbed on metals as 
ca. 0.24 eV by Rossmeisl et al. [11]. E(H # ) is half the 
bond energy of the hydrogen molecule, which is 2.24 eV 
(including the molecule’s ZPE). 

Trends in AG°(H ads ) or E bind among different 
metals often conform to the so-called d-band model 
proposed by Hammer and Norskov [14, 21]. This 
model claims a close correspondence between the 
strength of a surface bond (of H, but also of other 
adsorbates such as O, OH, and CO) and the location 
of the center of the d band of the metal surface. The 
energy of the d band may be “engineered” by changing 
the properties of the metal surface, for instance by 
(surface) alloying and overlayer formation [22]. The 
d-band center also influences the activation energy 
for dissociative adsorption, primarily through the 
Bronsted-Evans-Polanyi relationship [14], which 
assumes a linear relation between reaction energy and 
activation energy. The role of the d band in bond 
breaking electron transfer reactions was recently 
modeled by Santos and Schmickler [23], and will be 
discussed briefly in the next section. 


The above mechanism, based on reactions la-lc, is 
rather specific to the HER on metal electrodes. If such 
a mechanism takes place on a molecular catalyst, the 
catalyst typically has two nearby (metal) centers to 
coordinate the hydrogen and adsorbed hydrogen is 
often referred to as “hydride.” The mechanism may be 
termed “homolytic” in this case, akin to the Volmer- 
Tafel mechanism. On many molecular catalysts or 
enzymes, however, the general mechanism tends to be 
different, and may be considered a variant of the Tafel- 
Heyrovsky mechanism, with the difference that the 
proton is not dissolved in the liquid phase, but also 
bound to a nearby “docking” site [ 1 ]. In such mecha¬ 
nisms, it is typically assumed that the release of 
the hydrogen molecule from the active site is an 
activated process (though the activation barrier may 
be small). Such a heterolytic mechanism may be writ¬ 
ten as follows: 


H + +2e-+*i;H c - h 

(11a) 

H++* , ^H+' 

(11 b) 


(11 c) 

H 2)C h H 2 + * + *' 

(1 Id) 


where the two different coordination sites have been 
labeled as * and The two-electron transfer character 
of reaction 11 should be considered as formal; the 
charge distinction between H _ and H + may be exag¬ 
gerated. Note that we have now the binding energies of 
H^ and H^ as the main descriptors for the thermody¬ 
namics of hydrogen evolution, so strictly speaking we 
have two descriptors now. It is not difficult to under¬ 
stand from Eqs. 11 that the properties of this mecha¬ 
nism will depend on the basicity/acidity of the 
coordinating sites * and As a result, the overall 
activity for hydrogen evolution will normally be 
strongly pH dependent. By contrast, hydrogen evolu¬ 
tion on metals, which only involves neutral hydrogen as 
intermediate (at least formally), is typically not 
strongly pH dependent [1, 4]. 

More Detailed Theories and Simulations 

The phenomenological theory of the previous section 
has an old history in the electrochemistry literature, 
and can be given a much more detailed treatment than 
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given here, for which reference is made to the original 
literature [15-18, 20]. More detailed molecular-level 
simulations of the hydrogen evolution (and hydrogen 
oxidation) reaction have been carried out in recent 
years, and it is of interest to discuss the main results 
from those investigations. 

Santos and Schmickler consider that modeling 
hydrogen oxidation or hydrogen evolution fully from 
first principles is currently still too ambitious a task, 
and more general insight can be obtained from taking 
a simpler approach [23-26]. They have formulated 
a simple Hamiltonian which includes two hydrogens 
and the metal, and the electronic interaction between 
them [27] . The interaction with the solvent is treated in 
a spirit similar to the Marcus model. The ground-state 
energy of the model Hamiltonian may be calculated 
(with some approximations) and the interaction 
parameters entering the expression for the ground- 
state energy (strength of electronic interaction between 
hydrogen atoms; strength of electronic interaction of 
the hydrogen with the metal states; metal surface elec¬ 
tronic band structure; strength of interaction of the 
hydrogen with the solvent) may obtained from more 
detailed calculations, such as DFT. Although the effect 
of the simplifications of such an approach are some¬ 
what difficult to estimate, the great advantage of this 
method is that it allows for a more detailed description 
of the HER/HOR without having to resort to extremely 
complicated calculations, which may have their own 
drawbacks. Moreover, the coupling between bond 
breaking and electron transfer can be followed in an 
instructive way. The activation energy in the theory of 
Santos and Schmickler is determined by the interaction 
of the bonding orbital of the hydrogen molecule with 
the d band of the metal catalyst as it passes the Fermi 
level of the metal. At this critical stage, the occupation 
probability of the bonding orbital is about one, so that 
one electron has already been transferred to the metal 
(in terms of the HOR). However, the activation energy 
for bond making or bond breaking does not only 
depend on the center of the d band as also predicted 
by the theory of Hammer and Norskov. There is an 
additional important role of the electronic coupling 
between the hydrogen and the metal surface. The 
strength of the electronic coupling determines the 
broadening and splitting of the orbitals involved in 
the charge transfer reactions, and normally a stronger 


electronic coupling leads to a lowering of the activation 
energy. Electronic coupling strengths typically increase 
toward the lower left-hand corner of the relevant part 
of the periodic table containing the coinage and tran¬ 
sition metals, because of the diffuseness of the metal 
orbitals involved in the interaction increases in that 
direction. Based on their detailed calculations, Santos 
and Schmickler have criticized the use of volcano rela¬ 
tionships in understanding trends in the metal- 
dependent reactivity of the HER/HOR. First of all, for 
most of the metals for which calculations can be 
performed, the data points lie on the ascending branch 
of the volcano curve. Metals that bind hydrogen very 
strongly also tend to bind oxygen very strongly, and 
therefore points on the descending branch of the vol¬ 
cano (increasing hydrogen binding strength) pertain to 
metals covered by an oxide film under aqueous elec¬ 
trochemical conditions. Hence, the experimental 
evidence for a volcano relationship is weak, if any, 
according to Santos and Schmickler. While this argu¬ 
ment may be correct for the “older” volcano curves 
encompassing a wide variety of metals, it may not 
apply to some recent volcano curves (to be discussed 
in the section Hydrogen Evolution at Metal Elec¬ 
trodes). Secondly, the activation energy for hydrogen 
evolution and hydrogen oxidation depends not only on 
the strength of the interaction between hydrogen and 
the metal surface, but also on effects not (directly) 
incorporated into the simple model discussed in the 
previous section, such as the strength of the electronic 
coupling. The weak electronic coupling between nickel 
and hydrogen may explain the rather low activity of 
nickel, whereas the strong electronic coupling between 
rhenium and hydrogen may explain the high HER 
activity of rhenium [28]. Santos and Schmickler have 
argued that these two metals indeed appear to deviate 
from the volcano plot, precisely for the reason men¬ 
tioned. A final point raised by Santos and Schmickler 
that is not explicitly included in the analysis of the 
previous section is that the binding energy of hydrogen 
is coverage dependent and that different metals may 
show their optimum activity at different coverages. As 
a result, Santos and Schmickler refer to the volcano 
curve, as applied to the HER/HOR as a “venerable” but 
effectively “defunct” and “oversimplistic” concept. 

One of the simplifications made in the Santos- 
Schmickler approach to the HER/HOR is the way in 
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which the role of the solvent is included. The atomic 
level modeling of the solvent in the initial stages of 
proton reduction, without resorting to fully ab initio 
methods, was considered by Wilhelm et al. [29, 30]. 
They took the so-called empirical valence bond model 
(EVBM) to describe proton exchange in water to 
include proton transfer to a Pt( 111) electrode surface. 
The model was parameterized on the basis of density 
functional theory calculations. The advantage of this 
approach over fully ab initio simulations (see below) is 
that much longer simulation times can be probed and 
therefore a more meaningful statistics can be obtained. 
This setup allowed the authors to study the dynamics of 
proton transfer to Pt(lll) as a function of the charge 
density on the electrode surface. Figure 2 shows the 
charge dependence of two rate constants obtained from 
their detailed simulations: the inverse time needed for 
the proton to reach the “final” adsorption state, still 
being a proton, and the inverse time needed for 
“adsorption-to-discharge.” Figure 2 indicates that at 
strongly negative surface charge, the proton transfer 
reaction is dominated by the time the proton needs to 
approach the surface, and proton discharge is very fast. 
At less negative surface charge density, the rate for 


establishing an orientational fluctuation suitable for 
proton transfer to the surface is rate determining. 

Otani et al. [5] have carried out a full ab initio 
molecular dynamics simulation of the proton transfer 
reaction on a Pt(lll) surface. This was the first ab 
initio dynamical simulation of an electrode reaction, 
but it should also be emphasized that this was a single¬ 
trajectory simulation, without statistical sampling, 
and that proton transfer was obtained only if exces¬ 
sively high electric fields were applied. Nevertheless, 
expect more of this kind of work in the future, and 
combined with rare-event sampling techniques, this is 
a potentially very powerful method. More recently, 
Santana et al. have used a similar approach to study 
the HOR on Pt(110) [31]. 

An alternative approach to studying the dynamic 
aspects of proton reduction and/or hydrogen evolu¬ 
tion is to compute the potential energy surface for 
such reactions from detailed DFT calculations. 
A potential energy surface is generated at zero tem¬ 
perature, and therefore it does not include any finite 
temperature effects. Such an ab initio treatment of the 
hydrogen evolution reaction has been carried out 
recently by Skulason et al. [6, 32]. They included the 
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Top : Logarithmic representation of the rate for proton discharge {blue, full line) and the proton bulk-to-adsorption rate 
{green, dashed) on Pt(111) (Reproduced with permission from Ref. [30]) 
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effect of the electrode potential by considering various 
concentrations of protons (simulating different 
surface charge densities) within simulation cells of 
varying size, and extrapolating the obtained energies 
to infinite cell size to simulate a potentiostatic exper¬ 
iment. For the Volmer reaction on Pt(111), a barrier of 
ca. 0.5 eV was calculated. The Heyrovsky reaction was 
concluded to be the reaction with the highest barrier at 
0 V, ca. 1.4 eV for the HER, whereas the Tafel reaction 
has a potential-independent barrier of ca. 0.8 eV at 0 V. 
As a result, the Tafel reaction is predicted to be the 
rate-determining step. 

Hydrogen Evolution at Metal Electrodes 

HER at Platinum Electrodes 

As platinum is one of the best and most studied cata¬ 
lysts for the HER/HOR, as well as the one used in fuel 
cells and in many (acid) electrolyzers, it is useful to 
summarize here some of the pertinent experimental 
results. Platinum is generally agreed to be a very active 
catalyst for HER/HOR, and considered to be located 
near the apex of the volcano. Standard exchange cur¬ 
rent densities for the HOR are not easy to obtain 
because in general it is difficult to correct accurately 
for mass transport limitations and double-layer charg¬ 
ing. Values from 10 -3 to 1 A cm -2 have been reported 
for platinum [4, 33, 34] which would roughly corre¬ 
spond to a turnover rate of 0.1-100 s -1 per Pt active 
site, at the standard equilibrium potential. The Pt 
“active site” is not well known, and there has been 
quite some controversy as to whether the HER/HOR 
is structure sensitive on platinum or not. In contrast to 
early studies, [35] the groups of Markovic [33] and 
Conway [34] have found that the Pt(110) surface is 
the most active in catalyzing the HER/HOR, typically 
being an order of magnitude more active than the other 
low-index surfaces. 

In electrochemical experiments, rate-determining 
steps are usually inferred from so-called Tafel slopes, 
which essentially express the overall potential depen¬ 
dence of the reaction rate (i.e., the current) in a certain 
potential range. If the Volmer reaction is rate determin¬ 
ing and its rate obeys the Butler-Volmer model, then 
the potential dependence is equivalent to a Tafel 
slope of lnlO x RT/(ocFE) & 120 mV/dec if a ^ 0.5. If, 


on the other hand, the Heyrovsky reaction is rate 
determining, and the preceding Volmer reaction in 
“quasi-equilibrium,” the potential dependence of 
the reaction is equivalent to a Tafel slope of In 10 x 
RT/[( 1 + ol)FE] « 40 mV/dec if a ^ 0.5. Finally, if the 
Tafel reaction is rate determining, the Tafel slope will be 
lnlO x RT/[2FE] « 30 mV/dec. In these predictions of 
the Tafel slope, saturation effects on the surface are not 
included. 

Markovic et al. [33] have measured Tafel slopes for 
both the HER and the HOR on Pt single-crystal elec¬ 
trodes in acidic media. Tafel slopes also appear to 
depend on the crystal plane; the most active Pt(110) 
surface exhibits the 30 mV/dec Tafel slope well known 
from polycrystalline Pt [34, 36]. This Tafel slope corre¬ 
sponds to a Tafel-Volmer mechanism, i.e., a combina¬ 
tion of reactions la and lc. 

The HER/HOR reaction kinetics is first order in the 
proton concentration over a wide pH range (0-14), 
implying that overpotentials are similar on the RHE 
(reversible hydrogen electrode) scale, in full agreement 
with the idea that proton and electron transfer are 
coupled over the entire pH range. However, there is 
a nontrivial pH effect in the sense that in alkaline 
media, the rate of the reaction tends be about an 
order of magnitude lower than acidic media, at the 
same overpotential [37]. Conway et al. [37] have 
explained this effect by a more difficult abstraction of 
H from H 2 0 than from H 3 0 + . Pt(110) is still the most 
active surface, also in alkaline media, but the Tafel slope 
seems to be different, closer to 60 mV/dec, although no 
clear straight line was obtained. 

An important discussion in the surface electro¬ 
chemistry literature concerns the nature of surface- 
adsorbed hydrogen that acts as a precursor for hydro¬ 
gen evolution in the HER, or for proton formation in 
the HOR [4, 38]. It is well known that at potentials 
above the reversible hydrogen potential, from ca. 0.05 
to 0.3 V versus RHE (i.e., where no hydrogen evolution 
takes place), adsorbed hydrogen is formed on plati¬ 
num. This form of adsorbed hydrogen is referred to 
as “underpotential deposited” (upd) hydrogen [38]. 
The surface coverage of this H upd follows a Frumkin 
isotherm on Pt(lll) and Pt(100) [38-40] with weak 
repulsive interactions between the adsorbed hydrogens. 
The coverages of H upd in acidic and alkaline media on 
Pt(lll) are almost identical at the same overpotential, 
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but again the kinetics of its formation is much slower in 
alkaline media [41]. The adsorbed hydrogen that par¬ 
ticipates in the HER at potentials around 0 V, is usually 
termed “overpotential deposited” (opd) hydrogen. The 
exact relation between H upd and H opd has lead to ample 
discussion and speculation; for an illustrative account, 
see the paper by Conway and Jerkiewicz [42]. Most 
probably, H upd and H opd should primarily be distin¬ 
guished on the basis of the different overall hydrogen 
coverages applicable to both states, with a typical H upd 
coverage being lower than 0.5-1.0, and a typical H opd 
coverage being close to or higher than 1 (i.e., including 
the H upd coverage, implying more than 1 H per surface 
Pt atom), suggesting that H upd and H opd reside at 
different sites on the surface. The different coverages 
lead to different lateral interactions, different energet¬ 
ics, and site preferences, and therefore different activi¬ 
ties toward H 2 evolution. 

There have been a number of attempts to probe the 
adsorbed hydrogen on platinum using vibrational 
spectroscopy. A recent surface-enhanced infrared (IR) 
study on a platinum film electrode by Kunimatsu et al. 
[43] confirmed earlier results by absorption-reflection 
IR [44] that for potentials lower than ca. 0.1 V RH e> 
atop-bonded Pt-H is formed. The signal of this band 
correlated with the hydrogen evolution current, and 
therefore this species was identified as the intermediate 
in the HER. Kunimatsu et al. suggest that H upd adsorbs 
below 0.3 V RH e in hollow sites, though this state has 
not been resolved spectroscopically, and that, in addi¬ 
tion to H upd , H opd adsorbs below 0.1 V RH e at atop sites, 
and serves as the reaction intermediate. 

Experimental Evidence for Volcano Behavior on 
Strained Monolayer and Bimetallic Systems 

The evidence for a volcano-type relationship has been 
collected by Trasatti [18], and more recently by 
Norskov and Stimming and coworkers [ 17] . One prob¬ 
lem with such a correlation is that it is partially based 
on including metals that under experimental condi¬ 
tions are most likely covered by an oxide [20]. Recent 
studies by Kibler et al. [45, 46] have employed the idea 
of using palladium monolayers on different substrates 
[from Pd on Re(0001) to Pd on Au(lll)] to tune the 
reactivity toward hydrogen adsorption and hydrogen 
evolution. Since the potential of the H upd peak 


observed in voltammetry is directly related to the 
adsorption energy of hydrogen (or more accurately, of 
Hupd), the authors could relate the peak potential to the 
theoretically predicted d-band shift with respect to Pd 
(111). The good correlation found suggested that this 
monolayer approach is a viable and controlled way to 
tune reactivity. In a subsequent study, the same systems 
were studied for their HER activity. Figure 3 shows the 
plot of the standard exchange current density versus the 
free energy of adsorption of hydrogen, clearly in agree¬ 
ment with the predictions of the Sabatier principle and 
the volcano plot. Note that from these results, an opti¬ 
mum standard exchange current density of ca. 0.01-0.1 
A cm -2 is predicted. 

Clearly, a catalyst consisting of Pd, Ru, and Pt is 
hardly an economical alternative for Pt, which is 
already quite a good catalyst for the HER/HOR. There¬ 
fore, Greeley et al. [47] have carried out an extensive 
computational study of bimetallic surfaces, screening 
for specific combinations with a AG°(H*) close to 0, 
with a negative formation energy to ensure stability of 
the material, and preferably consisting of cheaper ele¬ 
ments. The counterintuitive combination of Pt and Bi 
emerged as one of the possibly attractive materials, 
with a AG°(H*) closer to zero than Pt. An appropri¬ 
ately prepared Pt-Bi surface alloy was subsequently 
found to indeed have a higher activity for the HER 
than Pt, showing the promise and usefulness of this 
computational screening strategy. 

Hydrogen Evolution at (Electrodes Modified with) 
Molecular Catalysts in Aqueous and Nonaqueous 
Solvents 

There have been ample efforts to develop molecular 
catalysts for hydrogen evolution, the majority of them 
inspired to mimic the activity of natural hydrogenases, 
resulting in an extensive literature on this subject that 
will not be summarized here. For some recent reviews, 
the reader may consult various recent sources [48-51]. 
These biomimetic complexes are usually organometal- 
lic compounds containing Fe, Ni, Co, and/or Ru. It is 
also known that certain nonmetal containing organic 
molecules (such as bipyridine) may enhance the HER 
on metal electrodes otherwise not particularly active 
for the HER, such as mercury or gold (see, e.g., Ref. 
[52] and references therein). 
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Electrochemical Hydrogen Production. Figure 3 

Hydrogen evolution exchange current density as a function of the theoretical differential free energy of hydrogen 
adsorption, AG°(H*). The dashed line shows the functional form of the theoretical exchange current density, and the black 
diamonds correspond to the predictions for specific overlayers. The black circles represent experimental measurements 
(the measured exchange current density determines the ordinate for these points, and the abscissa is specified by the 
theoretical AG°(H*) value). Pd*/X denotes a pure overlayer of Pd on a substrate of metal X (Reproduced with permission 
from Ref. [46]) 


Hydrogenases themselves have also been investigated 
as hydrogen evolution catalysts, and when adsorbed on 
graphitic electrodes, have been shown to exhibit very 
good catalytic properties, comparable to platinum. 
This latter subject has been discussed and reviewed 
extensively by Armstrong and coworkers [53]. To eluci¬ 
date the catalytic function of these enzymes, it is useful 
to discuss some existing DFT calculations on compu¬ 
tational hydrogenase models within the framework of 
the theory discussed in the section General Theory. 

All hydrogenases catalyze both HER and HOR. The 
[FeFe] hydrogenases are most active for hydrogen for¬ 
mation, while [NiFe] hydrogenases are poised for 
hydrogen oxidation. Mechanistic information has 
been obtained from enzyme kinetics, spectroscopy, 
X-ray structural data, and DFT calculations. A general 
feature is that hydrogen binding or dissociation, proton 
exchange, and electron transfer are separate steps. 
A second distinct difference with the metal surface- 
catalyzed reaction is the heterolytic splitting or forma¬ 
tion of hydrogen (reactions 11). A proton is abstracted 


by a nearby base, and hydride is stabilized by the metal 
ions rather than H*, although electron and spin den¬ 
sities are substantially delocalized. This is somewhat 
similar to the Heyrovsky reaction proposed for the 
mechanism on metals (reaction lb). As mentioned, 
on a platinum electrode the homolytic bond splitting 
or making (Tafel reaction lc) seems to be the preferred 
pathway. 

The mechanism of the HER/HOR by [FeFe] 
hydrogenases, as suggested by DFT calculations and 
experimental data, can be summarized as follows: [54] 


[Fe’^e 1 ] -1 +H+ -> [Fe II Fe I H] 

(12a) 

[Fe n Fe : H] + e“ ->• [Fe'Fe'H] -1 

(12&) 

[Fe'Fe’H] -1 + H + -> [Fe’Fe'HH] 

(12c) 

[Fe'Fe'HH] + e“ -> [Fe I Fe°HH]“ 1 

[Fe n Fe '?7 - H 2 ] _1 

(12 d) 

[Fe ll ¥e l r] - H 2 ] _1 -> [Fe'^e 1 ] -1 + H 2 

(12c) 
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The electrons are rapidly transferred from the 
reductant (or the electrode) to the active site via the 
chain of [FeS] clusters. From an electrochemical per¬ 
spective, these [FeS] clusters thus are intramolecular 
mediators. Although Fe-H-Fe bridging intermediates 
have been proposed, the most likely mechanism 
involves one-Fe bound H-species and side-on bound 
H 2 . The secondary amine in the bridging ligand and Fe 1 
appear to be equally potent as catalytic base. The iron 
most distant from the nearby [FeS] cluster, called the 
Fea site that shuttles between the formal 1+ and neutral 
state in reactions 12a-e, appears as the actual binding 
site for H and H 2 . By taking together the proton and 
electron transfer steps in reactions 12a-d, we can com¬ 
pare the DFT calculations performed by Sbraccia et al. 
[54] to the model presented in the section General 
Theory. For reactions 12a and 12b together 

[Fe'^e 1 ] -1 + H + + e“ -> [Fe’Fe'H] -1 (12/) 

the DFT-calculated reaction energy is —15.98 eV. Com¬ 
pared to the H 2 molecule, with a calculated total energy 
of 31.70 eV, the reaction energy is therefore —0.13 eV. 
For reactions 12c and 12d together: 

[Fe'Fe'H] -1 + H+ + e" ->• [Fe I Fe°HH]“ 1 
—> [Fe II Fe I 7y — H 2] -1 

the DFT-calculated reaction energy is —15.43 eV, imply¬ 
ing a reaction energy of +0.42 eV. For the final H 2 
release, reaction 12e, the reaction energy is —0.29 eV 
(with a calculated activation energy of 0.07 eV). Note 
that the three calculated reaction energies now add up 
to 0, implying that we are considering the reaction at 
thermoneutral conditions. Although this is an analysis 
based on total energies, as calculated by DFT, and not 
on free energies, these numbers still suggest that: (1) for 
hydrogenases, the key to an efficient catalyst is also to 
have an active site that binds H just as strong as H 2 (see 
also Ref. [55]), and (2) that the addition of the second 
hydrogen, reaction 12g, is thermodymically uphill and 
hence the potential determining step, if we neglect 
zero-point energy contributions and entropy. 

The organometallic nature of hydrogenase struc¬ 
tural mimics makes them more soluble in nonaqueous 
solvents than in water. This has led to the development 
of activity tests in acetone, propylene carbonate, or 
acetonitrile for evaluation of model compounds. This 


is obviously not without consequences for the defini¬ 
tion of pH and of the equilibrium potential of the redox 
couples. The equilibrium potential of the reference 
couple, £j^ H+ , is indeed strongly dependent on the 
nature of the protic species (solvated protons or weak 
acid). The overpotential for hydrogen evolution, which 
is the relevant experimental quantity when comparing 
different catalysts in different solutions, must therefore 
be referred to the reversible potential of the H + /H 2 
redox couple in the solvent under consideration. Work¬ 
ing in a nonaqueous solvent has two important 
consequences for the determination of the reversible 
potential. First of all, the solvation energy of the proton 
is solvent dependent, and leads to a concomitant shift 
of the standard equilibrium potential. Second, many 
proton donors dissociate only incompletely in organic 
solvents, and therefore the dissociation constant should 
be taken into account in the definition of the standard 
equilibrium potential. Felton et al. [56] have described 
how to correct for both effects properly, and have given 
a useful comparison of various catalysts in a typical 
nonaqueous solvent, i.e., acetonitrile. 

Considering the acid dissociation constant, K * c ^ A , 
in solvent “sol,” of the reaction: 

HA(sol)-H + +A“ (13) 

the standard equilibrium potential of the overall half 
reaction 


2HA(sol) + 2e ^ H 2 (g) + 2A 
in the solvent “sol” is given by 


p0,sol _ p0,sol 

% 2 /HA — % 2 /H+ 


2.303 RT 


P^°ha 


(14) 


(15) 


In nonaqueous solvents, values for or 

i?H 2 /HA are often referred to the ferrocenium/ferrocene 
couple in the same solvent. The value of E^J n+ would 
be a fundamentally more interesting reference, and if 
one is not interested in determining absolute electrode 
potentials, a simple way to look at the solvent depen¬ 
dence of E^ 2 °^ h + > as referred to E^J U+ , is by considering 
the difference in the absolute solvation energies of the 
proton in “sol” and water: 


C 0,sol _ c 0,aq , [// 0l (H + ) - ^(H+)] 

Ai 2 /H+ “ H 2 /H+ + p 


(16) 
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Values for the absolute solvation free energy (i.e., the 
Gibbs free energy of solvation not taking into account the 
surface potential of the solvent) of the proton in various 
solvents have been reconsidered recently by Fawcett [57] 
and the thus calculated values of Is^°^ + ( vs 
given in Table 1. The experimental estimates of £^ H+ 
have also been the subject of much work and debate. 
Fourmond et al. [58] have recently discussed their 
proper determination in a limited number of represen¬ 
tative solvents, based on original data of Daniele et al. 
[59] . These values are, however, quoted versus an internal 
solvent standard, i.e., the ferrocenium/ferrocene Fc +/0 
couple. The equilibrium potential of this couple is 
usually considered solvent independent, although 
water appears to be an exception [60]. 

Using the tabulations provided in Ref. [60], the 
third and fourth columns in Table 1 have been gener¬ 
ated, by first referring to decamethylferrocenium 
(DMFC)/decamethylferrocene couple (third column), 
which appears to be more solvent independent, and 
then rescaling such that in water the reference is at 0 V 
(fourth column). Although there is clearly a lack of data 
in this table to make any far-reaching conclusions, the 
differences between the theoretical and experimental 
values in acetonitrile and dimethylsulfoxide (DMSO) 
are typically within ±0.15 V. More data are obviously 
needed to further elucidate this correspondence. 

To illustrate the influence of a nonaqueous solvent 
on the HER activity, a comparison of different catalysts 
in acetonitrile is given in Figs. 4 and 5 [56]. It is seen 


Electrochemical Hydrogen Production. Table 1 

dependence of the equilibrium potential of the hydrogen 
electrode 


Sol 

versus 

SHE/V 

(From 

Ref- [ 1) 

versus 

^DMFc+/°/V 

(From Refs. 

[ , 1) 

p0,sol 

t H 2 /H+ 

versus. 

SHE/V 
Combining 
[ , i 

Water 

0 

-0.099 

0 

Acetonitrile 

0.414 

0.435 

0.534 

DMSO 

-0.207 

-0.202 

-0.103 

DMF 

- 

-0.162 

-0.063 

Methanol 

0.083 

- 

- 



E/Vvs Fc + /Fc 


Electrochemical Hydrogen Production. Figure 4 

Linear sweep voltammograms of hydrogen evolution for 
10 mM acetic acid in acetonitrile with 0.10 M 
tetrabutylammonium hexafluorophosphate at 1.0 V/s. 
Electrodes as indicated (GC is glassy carbon). Reversible 
potential for reduction of acetic acid shown for comparison 
(Reproduced with permission from Ref. [56]) 



E/V vs. Fc + /Fc 


Electrochemical Hydrogen Production. Figure 5 

Linear sweep voltammograms for 2 mM Fp2, an 
organometallic Fe compound, plus 10 mM acetic acid in 
acetonitrile with 0.10 M tetrabutylammonium 
hexafluorophosphate (solid curve) and 10 mM acetic acid 
alone (dashed curve). Hanging mercury drop electrode (area 
0.0096 cm 2 ). Scan rate: 1.0 V/s (Reproduced with 
permission from Ref. [56]) 
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that Pt is still the best catalyst of the four catalysts 
illustrated in Fig. 4, and also significantly better than 
the hydrogenase mimic in Fig. 5, but the overpotential 
is rather high compared to the situation in an aqueous 
solvent. As a practical compromise to comparing 
overpotentials, it has been suggested to compare half¬ 
wave potentials. A detailed discussion on the proper 
comparison of these half-wave potentials has been 
given recently by Fourmond et al. [58]. 

Concluding Remarks 

This entry has attempted to address some fundamental 
issues with studying the electrochemical production of 
hydrogen under various conditions and with various 
catalysts. The aim was to provide a common theoretical 
framework first, and then to show how to build on this 
framework and to understand the similarities and dif¬ 
ferences between the different catalytic approaches that 
have been used to tackle this important problem. It is 
hoped that this approach may be useful in ultimately 
developing a cheap, robust, and active catalyst for the 
electrochemical production of hydrogen. A main 
future direction for the development of new catalysts 
for electrochemical hydrogen production could be the 
combination of heterogeneous and molecular catalysis. 
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Glossary 

Activated carbon Also called activated charcoal or 
activated coal is a form of carbon that has been 
processed to make it extremely porous and thus to 
have a very large surface area available for EDLC, 
adsorption, or chemical reactions. 

Carbon nanotube Carbon nanotubes (CNTs), not to 
be confused with carbon fiber, are allotropes of 
carbon with a cylindrical nanostructure. Nanotubes 
have been constructed with length-to-diameter 
ratio of up to 132,000,000:1, significantly larger 
than any other material. These cylindrical carbon 
molecules have novel properties, making EDLC 
performances excellent in power capability. 

Electrochemical capacitor (EDLC) An electric dou¬ 
ble-layer capacitor, also known as supercapacitor, 
pseudocapacitor, electric double layer capacitor 
(EDLC), supercapacitor or ultracapacitor is an elec¬ 
trochemical capacitor with relatively high energy 
density. Compared to conventional capacitors the 
energy density is typically on the order of thousands 
of times greater than an electrolytic capacitor. In 
comparison with conventional batteries or fuel 
cells, EDLCs have lower energy density but 
a much higher power density. 

Energy density or Specific energy is defined as the 
energy per unit mass or volume. 

Hybrid (asymmetric) capacitor A hybrid capacitor 
consists of a battery-like (faradic) electrode and 
a capacitor-like (non-faradic) electrode, producing 
higher working voltage and capacitance. With these 
systems, one can certainly achieve twice or triple 
enhancements in energy density compared to the 
conventional EDLCs. 

Lithium-ion capacitor A Lithium-Ion Capacitor 
(LIC) is a hybrid type of capacitor. Activated carbon 
is used as cathode. The anode of the LIC consists of 
carbon material which is pre-doped with lithium 
ion. This pre-doping process lowers the potential of 
the anode and allows a high output voltage and 
higher energy density. 
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Nanohybrid capacitor A new lithium-ion-based 
hybrid capacitor using the lithium titanate 
(Li 4 Ti 5 0 12 ) negative intercalation electrode that 
can operate at unusually high current densities. 
The high-rate Li 4 Ti 5 0 12 negative electrode has 
a unique nano-structure consisting of extremely 
small nano-crystalline Li 4 Ti 5 0 12 nucleated and 
grafted onto carbon nano-fiber anchors (nc- 
Li 4 Ti 5 0 12 /CNF). 

Power density Power density (or volume power density 
or volume specific power ) is the amount of power 
(time rate of energy transfer) per unit volume. 

Definition of the Subject 

An electrochemical capacitor or electric double-layer 
capacitor (EDLC), also known as supercapacitor, 
pseudocapacitor, electrochemical double layer capaci¬ 
tor, or ultracapacitor is an energy storage device with 
high power and relatively high energy density. Com¬ 
pared to conventional electrolytic capacitors the energy 
density is typically 3 orders of magnitude greater. In 
comparison with conventional batteries or fuel cells, 
EDLCs show lower energy density but have a much 
higher power density. EDLCs have a variety of com¬ 
mercial applications, notably in energy-smoothing and 
momentary-load devices. They have applications as 
energy-storage devices used in vehicles, and applica¬ 
tions like wind power solar energy systems where 
extremely fast charging required. 

Historical Background 

GE, General Electric engineers experimenting with 
devices using porous carbon electrodes first observed 
the EDLC effect in 1957. In 1966, SOHIO Standard Oil 
of Ohio developed the modern version of the devices, 
utilizing two layers of activated charcoal separated by 
a thin porous insulator. This basic cell design remains 
the basis of EDLCs. SOHIO failed to commercialize 
their device, licensing the technology to NEC, who 
finally marketed “supercapacitors” in 1978, as back¬ 
up of a computer memory. Around the mid-1990s 
various advances in materials science and refinement 
of the existing systems led to rapidly improving per¬ 
formance and an equally rapid reduction in cost. The 
first industrial applications were carried out for 
supporting the energy supply to robots. 


In 2005, aerospace company, Diehl Luftfahrt 
Elektronik GmbH adopted EDLCs to power emer¬ 
gency actuation systems for doors and evacuation 
slides in airliners, including the new Airbus 380 
jumbo jet. As of 2007 all solid state micrometer-scale 
electric double-layer capacitors based on advanced 
superionic conductors had been developed for low- 
voltage electronics. 

Introduction 

Environmental destruction are getting more and more 
serious in the global scale. It is urgent for human 
beings to cope with environmental and energy prob¬ 
lems. The national development of each country 
greatly depends on economic activities oriented to 
environmental improvement business. Inevitably, in 
the future, it will be necessary to build an economic 
society taking into consideration the global ecology, 
that is, a society in which economic development is 
closely and directly related with eco-friendly activities 
such as carbon-reducing and energy-saving move¬ 
ments. Energy storage devices such as batteries and 
electrochemical capacitors hold an important key to 
solve the problems. The energy storage devices are 
some of the best fruits of our environmental technol¬ 
ogies, and they have large potential for promoting our 
international contribution. Specifically, electrochem¬ 
ical capacitors are green devices having such excellent 
characteristics that they have long lifetimes and can be 
rapidly charged and discharged [7, 28, 31]. In partic¬ 
ular, when combined with renewable energy plants 
(wind power, wave power and solar cell), they remark¬ 
ably show their advantage and effectiveness [6]. In 
view of that, the electrochemical capacitors have 
been vigorously researched and developed in these 
days, and are expected to improve their energy densi¬ 
ties. Thus, the technology is regarded as a promising 
art for storing electricity, and hence is expanding its 
possibilities. 

Electrochemical Capacitor and Battery 

Batteries store and deliver energy on the basis of chem¬ 
ical reactions, and hence their capacities depend on the 
redox reactions. In contrast, the energy storage or the 
charging-discharging mechanism by electrochemical 
capacitors are based on physical adsorption/desorption 
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behaviors of ions at the interface of electrode/ 
electrolyte, namely, electric double-layer [3, 6, 7, 28, 
31]. Accordingly, the electrochemical capacitors are 
often called “electric double-layer capacitor (EDLC),” 
and more recently they are called “Supercapacitor” or 
“Ultracapacitor” as they increase their performances 
like six characteristics described below: 

1. High power densities capable of instantly charging 
and discharging. 

2. Long lifetimes capable of more than 100,000 
charge-discharge cycles. Free from maintenance 
capable of using for a long time until abandoned. 

3. Excellent safety and reliability issues. Low heat gen¬ 
eration during operation. Recoverable even after 
short-circuiting. 

4. Wide operational temperature ranges. More than 
10°C lower and 20° C higher temperature limits as 
compared with any battery systems. 

5. No fear of material shortage. Activated carbon 
materials can be obtained from any organic 
substance. 

6. Free of heavy metals and halogens and Low envi¬ 
ronmental load. 

As described above, the electrochemical capacitors 
have various advantages such as high power, high 
charge-discharge cycleabilities, high safety, and favor¬ 
able working temperature ranges. The above character¬ 
istics are remarkably contrastive with those of lithium 


ion batteries as shown in Fig. 1. The other cannot 
replace one of these devices. The contrasting two 
types of energy devices as represented by lithium ion 
batteries and electrochemical capacitors can be com¬ 
pared to two different energy-providing systems in 
a human body, i.e., aerobic reaction system and anaer¬ 
obic reaction system. It is very interesting to see 
a human body from the viewpoint of its energy system. 
In a human body, ATP (adenosine triphosphate) effi¬ 
ciently takes charge of energy production and supply. 
The ATP is mainly produced by way of two routes 
(i.e., aerobic reactions and anaerobic reactions). The 
aerobic reactions, which proceed with oxygen obtained 
by breathing, slowly produce no less than 38 ATP 
molecules (from one glucose molecule), to provide 
large energy. On the other hand, the anaerobic reac¬ 
tions can rapidly produce two ATP molecules, to pro¬ 
vide small but instant energy according to necessity. 
The combination of these systems realizes such smooth 
energy supply as can correspond to any movement of 
human body. In the same way, the combinations of 
lithium ion batteries and capacitors give very high 
synergy effects. Consequently, energy management 
can be conducted under any situation by use of 
the two contrasting energy devices in exquisite 
combination. 

In many uses, lithium ion batteries and capacitors 
play complementary roles. Since the synergy effects 
obtained by their combinations are very high, practical 
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examples thereof are increasing in various fields. Fur¬ 
ther, the capacitors may be combined with devices 
other than the lithium ion batteries, for example, 
with fuel batteries or motor/inverters. At present, 
researchers pay their attention to smart systems 
employing the combinations of the capacitors and 
those devices. In consideration of necessity for effective 
and eco-friendly energy measures, it will be one of the 
large strategic aims to develop the composite energy 
storage systems. 

Operation Voltage of Electrochemical Capacitors 

However, since the EDLCs generally have low energy 
densities, their uses are limited and they cannot fully 
meet various performance demands required by the 
recent markets as shown in Fig. 2. Particularly in the 


field of automobiles, new energy devices are strongly 
desired to have hybrid characteristics between lithium 
ion batteries and EDLCs and thereby which can be 
suitably employed in idle reduction systems. Accord¬ 
ingly, it is expected for them to form a large market 
[28]. In order to satisfy the performance demands, it is 
often said to be necessary for the EDLCs to enhance 
their energy density to 20-30 Wh L -1 , which is approx¬ 
imately twice or more than present EDLCs, namely, 
5-10 Wh L -1 . For realizing the above high energy 
density, hybrid capacitor systems comprising non- 
aqueous redox materials are being dynamically 
researched and developed in recent years [5, 6, 10, 16, 
23, 31]. The present chapter deals with the recent 
contributions to get this high energy density by focus¬ 
ing on two major hybridized cell configurations in 
organic media. 
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Target value of the next-generation electrochemical capacitor 
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As is described above, increasing energy density is 
one of the most crucial matters. For conventional 
EDLC systems, designed with two symmetrical acti¬ 
vated carbon electrodes, increasing voltage is more 
effective because the energy density increases in pro¬ 
portion to the squared voltage. Thus, it is essential to 
develop higher electrochemical durability at the elec¬ 
trode/electrolyte interface. Currently, the maximum 
voltage of an activated carbon-based EDLC (AC/AC) 
is limited to 2.5-2.7 V in Propylene carbonate (PC) and 
Acetonitrile (AN) electrolytes. At applied voltage over 
2.7 V, there occurs a significant decrease in capacitance 
and an increase in internal resistance. In fact the float 
test revealed that the AC/AC capacitor lose its capaci¬ 
tance (AC < —13% after 30 days) at 2.5 V, whereas at 
2.9 V the capacitance loss became more significant 
(AC = —28%) for the same duration. The threshold 
cell voltages between 2.5 and 2.9 V certainly triggers 
a consecutive and fatal degradation. The undesired 
faradic process (failure modes) that leads to a 
capacitance fade is the most critical factor that deter¬ 
mines the life of conventional EDLCs and needs to be 
further investigated as in the recent review article by 
Simon et al. [28]. Anyway this withstanding-voltage 
limitation (2.5-2.7 V) certainly exists because of a gas 
evolution at higher voltages and is a barrier for further 
enhancement of the energy density of the conventional 
activated-carbon-based EDLCs. 

Attempts to Increase Energy Density of 
Capacitors 

There is presently a major effort to increase the energy 
density of EDLCs up to a target value in the vicinity of 
20-30 Wh kg -1 [7]. Many studies have been under¬ 
taken in order to enhance the energy density (E) of 
EDLC. There are mainly three approaches (Fig. 3): the 
first is to change the electrode (by higher-capacitance 
carbons or redox) and the second is to change electro¬ 
lyte (by durable new electrolyte or ionic liquids). 

© High working voltage(V) 

© High double layer capacitance(C) 

c _ i /o (r V 2x f c : Capacitance (F) 

t — l/t(CV ) I v : Working voltage(V) I 


Hybrid or Asymmetric Capacitor Systems 

The third and the most important approach to meet 
the goal that is under serious investigation is to develop 
asymmetric (hybrid) capacitors. As shown in Fig. 3 
various hybrid capacitor systems are possible by cou¬ 
pling redox-active materials (e.g., graphite [10, 31], 
metal oxides [5, 14], conducting polymers [17, 18]), 
and an activated carbon (AC). There are suggested two 
different systems, one is aqueous and the other is non- 
aqueous. Some of them are listed below in Fig. 4. These 
approaches can overcome the energy density limitation 
of the conventional EDLCs because it employs a hybrid 
system of a battery-like (faradic) electrode and a capac¬ 
itor-like (non-faradic) electrode, producing higher 
working voltage and capacitance. With these systems, 
one can certainly achieve twice or triple enhancements 
in energy density compared to the conventional 
EDLCs. 

FDK corporation proposes a “Dual Carbon Cell,” 
which is a device behaving “like a capacitor” on the 
basis of faradic reactions on the electrodes. When the 
Dual Carbon Cell is charged, anions and cations (such 
as lithium ions) are intercalated into the positive 
graphite electrode and into the negative graphite elec¬ 
trode, respectively. As a result, an acceptor type graph¬ 
ite intercalation compound is formed on the positive 
electrode while a donor type one is formed on the 
negative electrode. Thus, the Dual Carbon Cell is 
charged or discharged like a capacitor is. It is reported 
that, if the positive electrode is made of graphite doped 
with boron, the Dual Carbon Cell can have improved 
load characteristics. 

Meanwhile, many other hybrid capacitors are also 
proposed each of which utilizes a charge transfer reac¬ 
tion (faradic reaction) on either positive or negative 
electrode whereby one could have a good scenario 
attaining higher voltage and higher energy density. 
Figure 5 represents the voltage profiles of the conven¬ 
tional EDLC system (upper), Nanogate (middle), and 
LIC (lower). Examples of them include a “nanogate” 
capacitor or a “nanostorage” capacitor (Fig. 5, middle), 
in either of which the negative electrode is an activated 
carbon but the positive electrode is an amorphous car¬ 
bon capable of undergoing an anion-intercalation 
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Energy density enhancement of electrochemical capacitors 
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Aqueous and non-aqueous hybrid capacitor systems 


reaction at a high potential (not lower than 4.5 V vs. Li/ 
Li + ), while a normal capacitor comprises both positive 
and negative electrodes made of activated carbon 
(Fig. 5, upper). 

Lithium-Ion Capacitor (LIC) 

Among high-energy hybrid capacitors comprising 
non-aqueous redox materials, a hybrid system called 
“lithium-ion capacitor (abbreviated as LIC)” has 
particularly attracted the attention in these days 
[10, 31]. The LIC is a hybrid capacitor in which the 
positive and negative electrodes are made of activated 
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Voltage profiles of a conventional EDLC, Nanogate, and 
LIC systems 


carbon and of graphite pre-doped with lithium ions, 
respectively (Fig. 6). 

Since the negative electrode of graphite undergoes 
the reaction at a potential a little over 0 V vs. Li/Li + , 
the LIC has a high working voltage of 3.8 V to 4.0 V. 
This high working voltage enables the lithium-ion 


capacitor to realize both a high power density of 
approximately 5 kW kg -1 and an energy density of 
approximately 20-30 Wh kg -1 . The LIC thus exhibits 
favorable performance, and is regarded as a candidate 
of the next generation electrochemical capacitor. 
Hence, some Japanese companies including JM energy, 
FDK, ACT, etc. have already started commercializing 
LIC [10]. 

Generally speaking, instead of achieving energy 
density improvement, the high working voltage causes 
an electrolyte decomposition problem especially at 
negative graphite electrode [21] (Fig. 7). The limited 
charging rate and the low temperature performance are 
the possible drawbacks of the LIC because it has 
a lithium-intercalation negative electrode. The process 
of pre-lithiation of the graphite electrode may lead to 
poor cost-effectiveness or instability in the quality 
of the LIC device when they are mass-produced 
(see Fig. 8). This “pre-doing of Li + ” may bring about 
the same risks relevant to a long-term stability to keep 
low ESR. This is an issue that is specifically important 
for an electrochemical capacitor as a “power device” 
because it lead to create a high impedance electrode/ 
electrolyte interface and thus eventually leads to 
a deteriorated power performance for longer cycles. 

Nanohybrid Capacitor (NHC) 

Very recently, Naoi’s group and Nippon Chemi-con 
developed a hybrid capacitor system that certainly 
achieves a high energy density, high stability, and high 
safety. This is called “Nanohybrid Capacitor” (abbrevi¬ 
ated as NHC) using a super-high-rate nano-structured 
lithium titanate (Li 4 Ti 5 0 12 )/carbon composite nega¬ 
tive electrode. They have kept their eyes on lithium 
titanate (Li 4 Ti 5 0 12 ) as a stable and safe redox material 
capable of increasing the energy densities of hybrid 
capacitors without sacrificing interfacial characteris¬ 
tics. The Li 4 Ti 5 0 12 operates at a potential (1.55 V vs 
Li/Li + ) out of the range where the electrolyte solution 
may be decomposed, and hence play key roles for 
providing capacitor systems as stable and safe as 
EDLCs (see Fig. 9). The Li 4 Ti 5 0 12 as a redox material 
for hybrid capacitors has the following essential advan¬ 
tages in energy density, in stability, and in safety: 

1. It exhibits a high coulombic efficiency close to 

100% during charge-discharge cycle 
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Cell configurations of Li-ion Battery (LIB), EDLC, and LIC 
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Figure 7 

Reliability and electrolyte decomposition at positive and 
negative electrodes of LIC systems 

2. It has a theoretical capacity (175 mAh g -1 ) four 
times higher than as activated carbon 

3. It undergoes charging and discharging at a constant 
potential of 1.55 V vs. Li/Li + , where the electrolyte 


solution is free from the fear of decomposition 
(no SEI formation and no gas evolution) [20, 27] 

4. As shown in Fig. 10, it changes the volume to a very 
small degree (0.2%) in charging or discharging in 
intercalating or deintercalating Li + ions (zero-strain 
insertion) 

5. Inexpensive raw material 

As summarized in Table 1, a capacitor cell compris¬ 
ing Li 4 Ti 5 0 12 is, therefore, expected to be free from 
necessity for pre-doping of lithium ions (because of 
the above advantage (1)), to be capable of having a high 
energy density (because of the advantage (2)), and to be 
excellent in stability and in safety (because of the 
advantages (3) and (4)). 

Amatucci et al. firstly introduced the Li 4 Ti 5 0 12 /AC 
system as safer hybrid systems [1, 2]. However, the 
conventional Li 4 Ti 5 0 12 has the greatest problem of 
low power characteristics that stem from inherent poor 
Li + diffusion coefficient (<10 -6 cm 2 3 s -1 ) [29] and 
poor electronic conductivity (<10 _13 Q _1 cm _1 ) [8]. 
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Figure 8 

Necessary pre-doing of Li + for LIC systems 


Accordingly Li 4 Ti 5 0 12 had a big problem of slow 
output characteristics and was not fully developed for 
the application in electrochemical capacitors at 
that time. For the purpose of solving the problem of 
poor output performance, some measures can be 
thought of. For example, the Li 4 Ti 5 0 12 may be ground 
down to nano-sized particles and may be combined 
with an electro-conductive material to prepare a 
composite. 

In the nano-sized Li 4 Ti 5 0 12 particles, it is expected 
that Li + ions diffuse and electrons migrate in distances 
reduced by 1/1,000 (see Figs. 11 and 12) or less as 
compared with the distances in normal micron-order 
Li 4 Ti 5 0 12 particles. Further, in the composite, electron 
paths are effectively formed in the electrode of 
nano-sized Li 4 Ti 5 0 12 particles (Fig. 13). On the basis 
of these expectations, a composite (nc-Li 4 Ti 5 0 12 /CNF) 
has been synthesized by means of a new method 
referred to as “Ultracentrifugal force (UC) nanohybri¬ 
dization method” [19, 24]. Specifically, the com¬ 
posite (nc-Li 4 Ti 5 0 12 /CNF) comprises nano-crystalline 
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Figure 9 

Voltage profile of conventional EDLC, LIC, and NHC 


Li 4 Ti 5 0 12 particles hyper-dispersed on carbon nano¬ 
fibers (CNFs) [13] having high electro-conductivity 
(25 Q _1 -cm _1 ). They utilized the composite as the 
negative electrode active material, and thereby 
succeeded in producing a novel hybrid capacitor 
(nanohybrid capacitor) realizing both high power and 
high energy density. 

The nc-Li 4 Ti 5 0 12 negative electrode was developed 
to have a unique nano-structure that can operate at 
unusually high current densities. Nano-crystalline 
Li 4 Ti 5 Oi 2 attached onto carbon nano-fibers were 
prepared by a unique technique (UC method) of a 
mechano-chemical sol-gel reaction under ultracentrif¬ 
ugal force field [13, 19, 24], followed by an instanta¬ 
neous heat treatment for very short duration (see 
Fig. 14a). The UC method is induced to form, anchor, 
and graft the nano-Li-Ti-O precursors on the CNF 
matrices. The subsequent instantaneous heat treat¬ 
ment is of prime importance to precisely achieve all 
of the following: high crystallization of Li 4 Ti 5 0 12 , [13] 
inhibition of the CNF oxidation decomposition dur¬ 
ing the annealing at high temperature, [31] and sup¬ 
pression of the agglomeration of the Li 4 Ti 5 0 12 
particles. These processes are quite simple and require 
only a few minutes. Actually, the power characteristic 
of the prepared composite (nc-Li 4 Ti 5 Oi 2 /CNF) made 
a new bench mark (charge-discharge time = 12 s) 
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Structural and volumetric changes at the charged and discharged states of Li 4 Ti 5 0 12 
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[4, 11,12,15, 24, 32] which exceeds greatly the shortest 
charge-discharge time that has ever been attained (see 
Fig. 14b). 

XRD analysis was performed to confirm the forma¬ 
tion of the nc-Li 4 Ti 5 0 12 and the presence of the CNF in 


the nc-Li 4 Ti 5 0 12 /CNF composites. Figure 15a shows 
the XRD patterns of the prepared nc-Li 4 Ti 5 0 12 /CNF 
composite and pristine CNF. The composite has several 
sharp diffraction peaks at 2 6 = 18, 35, 42, 57, and 63°. 
These peaks correspond to (111), (311), (400), (511), 
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Increased accessibility and surface area as a function of the particle size of Li 4 Ti 5 0 12 
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Decreased diffusion length as a function of the particle size of Li 4 Ti 5 0 12 
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Hyper-dispersion of the nano-size particles of Li 4 Ti 5 0 12 


and (440) planes of a face-centered cubic spinel struc¬ 
ture with Fd3m space group [22, 30], respectively, 
indicative of the formation of crystalline Li 4 Ti 5 0 12 . 
Annealing at 900°C under vacuum well crystallizes 
the nano-particles of Li 4 Ti 5 0 12 . A broad peak 
at around 2 9 = 24.5° is observed which corresponds 
to the (002) plane of the pristine CNF [33]. This means 
that the CNF exists in the annealed composite and 
preserves its graphene layer structure. The fact that 
there are no other peaks observed corresponding to 
some possible impurities such as Ti0 2 , Li 2 C0 3 , and 
Li 2 Ti0 3 [9, 26] suggests that there are only two species, 
crystalline Li 4 Ti 5 0 12 and CNF. 

Thermogravimetric (TG) measurement of the nc- 
Li 4 Ti 5 0 12 /CNF composite was performed under air to 
estimate the residual weight ratio of the CNF. The 
obtained TG curve is shown in Fig. 15b. The weight 
loss at 400-600°C resulted by the oxidative decompo¬ 
sition of the CNF, and exactly 50 wt.% of the - 
nc-Li 4 Ti 5 Oi 2 remained. The 50 wt.% is well 
consistent with the weight ratio of the Li 4 Ti 5 0 12 to 
the CNF calculated on the basis of dosed Ti alkoxide 
weight before UC method. This fact implies that sol-gel 
reaction in the UC method stochiometrically proceeds, 
and the optimized (very short duration) annealing 
does not cause oxidative decomposition of the 
CNF. Such a stoichiometric preparation process (UC 
method and instantaneous annealing) is one of the 
important factors for cost-effectiveness of capacitor 
production. 


The nano-structures and crystallinity of nc- 
Li 4 Ti 5 0 12 and CNF in the composite is observed by 
HR-TEM (see Fig. 16). The image indicates that the 
edge or defect sites of CNF graphenes accommodate 
and graft nc-Li 4 Ti 5 0 12 particles. The results support 
that the Li 4 Ti 5 0 12 particles are formed through 
a nucleation process onto the edge and defect CNF 
sites. The clear facet of Li 4 Ti 5 0 12 reflects the high 
crystallinity that is consistent with the sharp XRD 
spectrum (see Fig. 16a), despite such a nanosize. 
Such a high crystallinity resulted in a reversible, 
smooth Li + insertion performance, and ca. 100% of 
coulombic efficiency. Also, HR-TEM images of CNF 
show clear graphene layers indicative of crystalline 
structure. Thus, this composite is considered to 
be the junction material of two crystalline species, 
Li 4 Ti 5 0 12 and CNF. Of particular interest is that 
the lattice matching of the nc-Li 4 Ti 5 0 12 particles, 
and CNFs are perfect and they are firmly tied at the 
atomic level. This could bring about an establishment 
of good electronic paths between the two species 
(Fig. 16b). 

Another closer view of an HR-TEM (see Figure 17) 
indicates the multiple attachments of the nc-Li 4 Ti 5 0 12 
particles on the inside wall of the CNF graphene sub¬ 
strate as well as outside wall of the CNF matrices. This 
may indicate that the crystallization would be such 
efficient on CNF and more importantly one can 
enhance both the energy density and the specific grav¬ 
ity of the nc-Li 4 Ti 5 0 12 . 

Figure 18a shows the charge-discharge curve of the 
half-cell Li/(nc-Li 4 Ti 5 0 12 /CNF) at 1 C. The horizontal 
axis represents specific capacity per unit weight of 
Li 4 Ti 5 0 12 . The dominant plateau was observed at ca. 
1.5 V versus Li/Li + that corresponds to the Li + interca¬ 
lation-deintercalation process of the crystalline 
Li 4 Ti 5 0 12 [13, 24, 30], indicating that the capacity 
of the composite is determined by the redox capacity 
of nc-Li 4 Ti 5 0 12 in the composite. The obtained capac¬ 
ity was 167 mAh g -1 per Li 4 Ti 5 0 12 , which is 95% 
of the theoretical capacity. It is noted here that the 
value of 167 mAh g -1 is obtained after subtracting 
the double-layer capacity of the CNF (8 mAh g _1 ). 
This result indicates that almost all of the nc-Li 4 Ti 5 0 12 
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Nano-structure model and high power performance of super-high-rate nano-crystalline Li 4 Ti 5 0 12 nested and grafted 
onto carbon nanofiber, (a) Schematic illustration for the two-step formation procedure of the nc-Li 4 Ti 5 0 12 /CNF composite, 
(b) Maximum C rate values of the nc-LLffisO^/CNF composite and various Li 4 Ti 5 O n 2 materials in literatures reported so far 


particles in the composite are electrochemically active, 
meaning that ionic and electric paths are fully 
established in the composite. The rate capability of 
the obtained composite is shown in Fig. 18b. 

Even at a high rate of 300 C, the composite shows 
reversible capacity of 158 mAh g -1 per Li 4 Ti 5 0 12 which 


correspond to 95% of the capacity obtained at 1 C. Such 
an excellent rate-capability indicates that the optimized 
nano-structure of the nc-Li 4 Ti 5 0 12 /CNF composites as 
observed in HR-TEM image well overcome the inher¬ 
ent problems of Li4Ti 5 0 12 materials like poor Li + dif- 
fusivity and poor electronic conductivity. Probably, 
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Crystallinity and content of nc-Li 4 Ti 5 0 12 in the composite, (a) XRD patterns of the nc-Li 4 Ti 5 0 12 /CNF composite and pristine 
CNF. (b) TG curve of the nc-Li 4 Ti 5 0 12 /CNF composite at 1°C min -1 under air. The residual weight ratio corresponds to 
the content of nc-Li 4 Ti 5 0 12 in the composite 


this is because the nano-crystallized Li 4 Ti 5 0 12 and 
Li 4 Ti 5 0 12 /CNF junctions lead to facile ionic diffu¬ 
sion and electronic conduction, respectively. The 
cycleability of the composite is shown in Fig. 18c. 
Even after 9,000 cycles, 90% of the initial capacity is 
maintained, showing that the composite is electro- 
chemically stable. The result strongly suggests that the 


aggregation and detachment of the nc-Li 4 Ti 5 0 12 parti¬ 
cles hardly happen when they are operated at high-rate 
charge-discharge for a prolonged cycling. 

Figure 19 shows Ragone plots obtained from 
charge-discharge measurements as laminate-type cell 
of the hybrid capacitor system ((nc-Li 4 Ti 5 0 12 /CNF)/ 
LiBF 4 -PC/AC). The charge-discharge was performed 
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Electrochemical Supercapacitors and Hybrid Systems. Figure 16 

Nano-structure of the nc-Li 4 Ti 5 0 12 and CNF. (a) A bird's eye view of a HR-TEM image of the nc-Li 4 Ti 5 0 12 /CNF composite 
and (b) a worm's eye view focusing on the junction of the nc-Li 4 Ti 5 0 12 particle on CNF graphene surface 





Electrochemical Supercapacitors and Hybrid Systems. 
Figure 17 

Closer view of a HR-TEM image of multiple attachments of 
nc-Li 4 Ti 5 0 12 inside and outside wall of the CNF composite 


between 1.5 and 3.0 V at various current densities 
ranging 0.2-30 mA cm -2 (0.18-26.8 A g _1 ). 
For comparison, a conventional EDLC system (AC/ 
TEABF 4 -PC/AC) was also assembled and measured 
between 0 and 2.5 V. In a low power density range of 
0.1-1 kW L _1 , the hybrid capacitor shows the energy 
density as high as 28-30 Wh L _1 , which is a value 
comparable to that of the Li-ion capacitors [31]. Even 
at a high power of 6 kW L _1 , the energy density of 
the hybrid capacitor remains at 15 Wh L -1 which is 
double that of the conventional EDLC system (AC/AC). 
The result reveals that the capacitor system can provide 
higher energy as compared with conventional EDLCs not 
only in the low power density region (0.1-1 kW L -1 ) but 
also in the high power density region (1-6 kW L _1 ). 
Accordingly, this configuration of capacitor system is 
anticipated as an energy device utilizable for both high 
energy and high power applications. 

Future Perspectives 

Hybrid capacitor systems are the promising approach 
to meet the goal to effectively increase the energy 
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Electrochemical Supercapacitors and Hybrid Systems. Figure 18 

Electrochemical properties of the nc-Li 4 Ti 5 0 12 /CNF composite, (a) Charge-discharge curve of nc-Li 4 Ti 5 0 12 /CNF composite 
between 1.0 and 3.0 V versus Li/Li + at 1 C. (b) Rate capability of the nc-Li 4 Ti 5 0 12 /CNF composite in a C-rate ranges from 
1 C to 300 C. (c) Cycleability of nc-Li 4 Ti 5 0 12 /CNF composite between 1.0 and 3.0 V versus Li/Li + at 20 C. 1 M LiBF 4 /EC + 
DEC(1:1) was used as the electrolyte 


density. The investigation to develop hybrid capacitors 
has been initiated by £C Li-ion capacitors (LIC).” And, 
now “Nanohybrid capacitor (NHC)” certainly achieves 
as high energy density as Li-ion capacitors with higher 


stability, higher safety, and higher productivity (see 
Fig. 20). Both LIC and NHC attain three-times higher 
energy density and are regarded as the next-generation 
supercapacitor systems. 
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Electrochemical Supercapacitors and Hybrid Systems. Figure 19 

Ragone plots of Nanohybrid capacitor systems ((nc-Li 4 Ti 50 12 /CNF)/AC) with two different Li 4 Ti 5 0 12 loadings (50% and 
70%) and conventional EDLC system (AC/AC) all in 1 M LiBF 4 /PC 



Electrochemical Supercapacitors and Hybrid Systems. 
Figure 20 

Performances of lithium-ion capacitor (LIC) and 
nanohybrid capacitor (NHC) in Ragone plot 
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Glossary 

Design for environment (DfE) The various design 
approaches geared at reducing the overall environ¬ 
mental impact of a product, process, or service, 
considering environmental impacts across the 
product’s life cycle. 

Ecycling The process of recycling the components in 
used or discarded electronic equipment. 

Electronic waste, e-waste and waste electrical and 
electronic equipment (WEEE) Old, end-of-life, 
or discarded appliances using electricity and includes 
computers, consumer electronics, fridges, etc. which 
have been disposed of by their original users. 

Extended producer responsibility (EPR) Entails 
making manufacturers responsible for the entire 
life cycle of the products and packaging they pro¬ 
duce based on the “polluter pays principle.” 

Persistent and bioaccumulative toxins (PBT) Com¬ 
pounds that are resistant to environmental degra¬ 
dation. Because of this, they have been observed to 
persist in the environment, bioaccumulate in 
human and animal tissue, and to have potential 
significant impacts on human health and the 
environment. 
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Toxicity characteristic leaching procedure (TCLP) 

A soil sample extraction method for chemical anal¬ 
ysis employed as an analytical method to simulate 

leaching through a landfill. 

Definition of the Subject and Its Importance 

Electronic waste or CC E-waste” is broadly defined as all 
surplus electronics. Electronic wastes are computers, 
television sets, mobile phones, etc., destined for reuse, 
resale, salvage, recycling, or disposal. Computers are 
a complex waste problem due to their volume, toxicity, 
and disposable, rather than recyclable, product design. 
Computers quickly become obsolete as technology rap¬ 
idly changes. Everyday more than 160,000 obsolete 
computers and televisions are generated in the USA, 
weighing almost 3.2 thousand metric tons (3.5 thou¬ 
sand US tons). Electronic waste is about 4% of the total 
solid waste stream in North America but is growing two 
to three times faster than any other waste stream (i.e., 
paper, yard waste). For every new computer coming on 
the market there is an obsolete computer [1]. In 1998, 
of the 20 million computers taken out of service in the 
USA, only 2.3 million (slightly more than 10%) were 
recycled or Ecycled, mostly from large businesses and 
institutions. As well, 75% of “obsolete” computers are 
being stockpiled in people’s homes [2]. 

Introduction 

With rapid improvements in technology, most com¬ 
puters are disposed of within 2 years, rather than at the 
end of their functional life cycle of approximately 
10 years [3]. Each new innovation in computer tech¬ 
nology doubles its obsolescence rate, without increas¬ 
ing its repairability or recyclability. The consumer, 
targeted by advertising about the latest computers, 
finds buying a new computer cheaper and more con¬ 
venient than upgrading their old one. Computers that 
cannot be upgraded become waste. 

Electronic waste is a complex waste, made up of 
more than 30 compounds, many of which are hazard¬ 
ous substances. The presence of hazardous substances 
in computers makes landfilling or incinerating envi¬ 
ronmentally unsuitable for discarded computers. The 
use of toxic metals and other chemicals in computers 
results in environmental and health risks when com¬ 
puters are manufactured, incinerated, landfilled, 


burned, or melted down during recycling [4, 5]. 
E-waste emits billions of kilograms of toxic materials 
into the air, water, and soil, most of which are persistent 
and bioaccumulative toxins (PBTs). A cathode ray tube 
(CRT), the most common type of computer monitor 
until recently, typically contains four pounds of lead to 
protect users from the tubes’ X-rays [2]. Lead is 
a teratogen and reproductive hazard. As well, lead 
causes damage to the central and peripheral nervous 
systems, blood system, and kidneys in humans. Circuit 
boards and batteries also contain lead, in addition to 
smaller amounts of mercury and hexavalent chro¬ 
mium, nickel, bromine, and zinc. In 2000, 90% of 
e-waste had no removal of components containing 
toxic mercury switches or nickel batteries before 
landfilling or incineration in Europe and North 
America. With these harmful chemicals, many con¬ 
sumer electronics fail the Toxicity Characteristic 
Leaching Procedure (TCLP) [6]. Leaching of heavy 
metals and brominated compounds occurs during 
landfilling, contaminating the soil and groundwater 
beneath a site and the surrounding area. For example, 
the European Commission estimates that consumer 
electronics constitute 40% of the lead found in landfills. 
Moreover, the workers in e-waste recycling operations 
face dangerous health and environmental risks. Hence 
the concerns about the trade, the technology in prac¬ 
tice, and the existing poor disposal practices of e-waste 
in North America and many nations. 

Increased regulation of electronic waste and con¬ 
cern over the environmental harm, which can result 
from toxic electronic waste has raised disposal costs. 
Regulation creates an economic disincentive to dispose 
of residue, unless strict restrictions enforce proper 
treatment. In extreme cases, brokers and others calling 
themselves recyclers export unscreened electronic 
waste to developing countries, avoiding the expense 
of removing items like bad cathode ray tubes (the 
processing of which is expensive and difficult). 

Hazardous e-waste from developed countries (e.g., 
Western European countries, USA, Canada, and Japan) 
has a history of being recycled in developing countries 
and in Eastern Europe. These products are exported 
when the state has banned various metals in their 
landfills and incinerators, rather than pay the higher 
labor and environmental costs for recycling locally [7]. 
As much as 80% of all recycled e-waste in the USA were 
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shipped to Asia (e.g., China, Cambodia, India, Indo¬ 
nesia, Malaysia, Manila, and Pakistan) all of which have 
poor regulation and management of e-waste [8, 9, 10]. 
This shipping to places with low labor costs and envi¬ 
ronmental regulation occurs for the EU as well but the 
new WEEE-IT (Information Technology) regulation at 
least requires that their e-waste be recycled at facilities 
having as high or higher environmental and safety 
standards as in the EU. 

A few demanufacturing operations exist in North 
America but are typically manned by unskilled or 
semiskilled labor such as prisoners, former welfare 
recipients, or mentally disabled individuals. These 
workers are exposed to hazardous conditions and 
often do not earn a living wage [11]. For example, 
250 inmates, at the Atwater federal US penitentiary’s 
recycling plant process 450 computers daily. The 
inmates receive only US $0.23-$ 1.15 per hour for this 
work and work without safety equipment, such as 
gloves, dust masks to reduce exposures to toxic lead, 
and barriers to prevent physical risks from cuts and 
lead exposure. Hammers are often used to break cath¬ 
ode ray tubes (CRT) rather than a sealed crushing 
machine, which is typical in commercial recycling, 
carrying out this work [12]. 

Regulation of Electronic Waste 

Extended producer responsibility (EPR) regulations in 
Europe hold producers who are legally responsible in 
four ways for their products. They are responsible for 
having: (1) Economic responsibility to pay all or 
a portion of end-of-life management, (2) Physical 
responsibility to take possession of their products 
after the consumer discards it, (3) Information respon¬ 
sibility to provide mandatory product labeling such as 
component or material lists, and (4) Financial liability 
for environmental damage and clean-up costs from 
disposal of hazardous products [6, 13]. Generally, 
EPR policies have three characteristics: (1) a focus on 
end-of-life waste management to encourage redesign 
for environment, (2) a shift of physical and/or financial 
responsibilities from taxpayer or consumer to pro¬ 
ducer, and (3) an explicit target for waste reduction. 
Mandated programs “force producers to get involved 
in managing material streams” [14]. Consumers of 
WEEE products must have the opportunity to return 


waste items, without charge, to collection facilities with 
the producer responsible for product recovery. 

Extended producer responsibility extends the “pol¬ 
luter pays principle” to consumer products, requiring 
“that the polluter should, in principle, bear the cost of 
pollution, with due regard to the public interest and 
without distorting international trade and investments” 
[15]. EPR policy places responsibility for a product’s 
end-of-life environmental impacts on the original pro¬ 
ducer and seller of that product requiring producers: to 
take back their products and to meet recycling rate 
targets. This approach provides incentives for pro¬ 
ducers to optimize their internal logistics and recycling 
systems to minimize costs, create value, and integrate 
eco-design into business routines. Those changes 
should include improving product recyclability and 
reusability, reducing material usage and downsizing 
products, lower energy consumption and greenhouse 
gas production, reduce dependency on virgin materials, 
spur new business enterprises, generate new job oppor¬ 
tunities, and provide financial and savings to companies 
improving their design, production, and distribution 
processes [ 16] . However, the ability to gain competitive 
advantage is restrained by the domination of collective 
compliance structures across the EU, over whose costs 
producers regard as difficult to manage or influence. 

Despite new EU shipment rules banning the export 
of hazardous materials in waste electrical equipment to 
non-OECD countries from January 2002, a few 
exporters have transferred over two million old com¬ 
puter monitors and one million old TVs out of the EU 
for reuse. For example, of an estimated 121,000 metric 
tons (133,000 US tons) of computer and telecommu¬ 
nications equipment exported from the UK for re-use 
or recycling in 2003, 21,000 metric tons (23,000 US 
tons) was “grey market” equipment being exported 
without the full necessary documentation. This equip¬ 
ment is rarely tested or repaired before export to coun¬ 
tries where dismantling takes place in unsafe 
conditions, or the equipment is simply dumped. With¬ 
out criteria for the reuse of IT appliances (e.g., testing, 
minimal capacity, etc.) this environmental injustice can 
continue. Very little consumer-based electronic waste 
has true reuse value, and claims of export for reuse 
should be scrutinized carefully. Although Canada pro¬ 
hibits waste export to non-OECD nations, this occurs 
disguised as computer donations for reuse. 


E 




3446 


E 


Electronic Waste and Its Regulation 


According to WEEE IT EU regulations, an average 
of 75% recovery by weight and of 65% component, 
material, and substance reuse and recycling must be 
achieved by December 31, 2006 [17]. These targets 
appear feasible, even low, based on pre-WEEE-IT pro¬ 
grams surpassing these targets. Prior to WEEE, several 
member states within the EU (Belgium, the Nether¬ 
lands, Sweden, Norway, and Switzerland) adopted 
national regulations and management schemes for 
e-wastes [6], with programs operating at a recycling 
rate (including energy recovery) of approximately 
80-97% with 97% achieved by Switzerland [18]. 
Although high targets are being set and a good 
recycling/reuse rate is achieved, still a significant 
amount of waste (10-20%) is not recovered for 
recycling and reuse, presumably ending up in landfills 
or incinerators, releasing toxic chemicals. That some 
91,000 metric tons (100,000 US tons) are incinerated or 
landfilled each year means computer waste has not 
been prevented and mass consumption of this disposal 
product continues to occur [19]. 

The USA and Canada have, rarely, if ever, mandated 
strict guidelines for product manufacturing, which 
includes both how the product is produced and what 
types of materials are used - perhaps as a result of the 
lack of political pressures to develop new waste man¬ 
agement strategies amidst relatively plentiful landfill 
accommodations. Policy makers in North America 
are reluctant to require sustainable product manage¬ 
ment. A few companies, such as HP and Dell, do 
voluntarily take back their products, but at high costs 
that many consumers are unwilling to pay or have 
severe restrictions that limit acceptance to consump¬ 
tion of their new materials. The recycling rates of vol¬ 
untary computer take-back programs in the USA and 
Canada conducted by a few companies, such as Dell, 
HP, Sony amount to only 14%, according to the most 
recent available figures, due to cost and acceptance of 
e-waste in landfills [2]. Many economically successful 
reuse and recycling programs are in effect [15, 20, 21, 
22] , even with producers’ focus on productivity and the 
bottom-line. Hewlett-Packard (HP) was the first com¬ 
puter corporation to have a “tear it down” operation to 
accompany its successful “build it up” operation called 
Product Recycling Solutions (PRS), launching a take- 
back program in 1992 [15] in which HP committed to 
reuse or donate or environmentally sound recycle their 


products [7], HP limits their financing to the pickup 
and transport of their printers and cartridges, requiring 
customers to pay $25-$50 Canadian for each e-waste 
component (CPU, monitor, scanner, server). HP 
witnessed numerous economic and market-share ben¬ 
efits from reducing resource and energy consumption 
of their products and production processes. An 87% 
landfill diversion rate reduced landfill costs, and 
reusing items accounted for $2.9 million Canadian in 
benefits for HP in addition to $2 million revenue for 
selling recycled materials in 2006 [23]. By applying 
reuse at HP, 15% of defects were reduced and produc¬ 
tivity increased by 57% [22]. However, it should be 
noted that although the company avoided 16,000 met¬ 
ric tons (18,000 US tons) of carbon dioxide (C0 2 ) in 
2006 through energy efficiency measures their emis¬ 
sions in 2006 were approximately lOOx that at roughly 
1,600,000 metric tons (1,800,000 US tons) of carbon 
dioxide [23]. The Electronic Product Stewardship 
Canada (2006) reports 91,000 metric tons (100,000 
US tons) of electronics reach end of life in Canada 
annually and only 11% of this is currently recycled. 

Future Directions 

Policies can encourage redesign for reuse and 
upgrading for longevity, as well as recycling: 
Computers can be redesigned to have less material 
(de-materialization), facilitate upgrading through 
modularity, last longer, expend less energy and pollu¬ 
tion in their production and use and that disassemble 
for reuse and recycling with ease after their useful life. 
Design for the Environment (DfE) promotes consider¬ 
ation of design issues related to environmental and 
human health over the life cycle of the product. Mate¬ 
rial selection, energy use, extended component life 
cycles, disassembly, reprocessing, remanufacturing 
must all be considered [24]. DfE has eight axioms: 

(1) manufacture without producing hazardous waste, 

(2) use clean technologies, (3) reduce product chemical 
emissions, (4) reduce product energy consumption, 
(5) use non-hazardous recyclable materials, (6) use 
recycled material and reused components, (7) design 
for ease of disassembly, and (8) product reuse or 
recycling at end of life [25]. Principles of ecodesign 
require products to be flexible, reliable, durable, adapt¬ 
able, modular, dematerialized, and reusable. As applies 
to computers, improving environmental factors can 
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improve performance as well: work on optimization of 
environmental performance often spurs creativity and 
brings innovative design solutions into the product 
development process which allows improvement of its 
overall performance [26] . 

Policies can encourage reuse, as well as, recycling. 
Recycling is the series of activities by which discarded 
materials are collected, sorted, processed, and used in 
the production [27]. Although recycling reduces virgin 
material use, it still requires additional energy to be 
used to reform them into manufactured products [27]. 
Silicon Valley Toxics Coalition (2001) estimates that 
the cost of recycling computers ranges from $10 to 
$30 per unit. While this is less expensive than the 
estimated $25-50 per unit cost for disposal, the value 
of recycled materials in computers is $2.00-$3.00. 

In contrast, remanufacturing, reconditioning, and 
repair should save money for the producer and reduce 
costs for the consumer. Over each computer’s life cycle, 
reselling or upgrading computers saves 5-20 times 
more energy than recycling and would provide 
a public good if provided to organizations for educa¬ 
tional purposes. For example, public schools have large 
needs for computers at 28.7 million computers in USA, 
which may be met by reusing or remanufactured com¬ 
puters, as recent state and local budget cuts otherwise 
severely limit the affordability of having computers in 
the classroom [28]. Reuse can include repair, recondi¬ 
tion, or remanufacture. Also, it is environmentally 
preferable to reuse computers an extra 2 or 3 years 
than to recycle their components. Reducing the 
amount of computer waste relies heavily upon the 
reuse of systems that may be out of date, but fully 
functional [3]. Remanufacturing is considered the 
best strategy as the embodied energy of virgin produc¬ 
tion is maintained, preserving the intrinsic “added 
value” of the product for the manufacturer and enables 
the resultant products to be sold “as new” with updated 
features if necessary [27]. Components and materials 
from the end-of-life durable products can often be 
refurbished to substitute for virgin parts to be used as 
spares or in remanufacturing [29]. Upgrading individ¬ 
ual components of a computer at the design stage, such 
as the hard drive, software, or processor, costs less than 
a new computer [3]. Lund, R. (1985) estimates that 
a remanufacturing product only requires 20-25% of 
the energy used in its initial formation and can be 


resold into lower-priced markets, typically costing 
60% of the original production cost [27, 30]. However, 
the lack of a credible and stable demand for recycled or 
refurbished products is a barrier to remanufacturing 
[27, 31]. The noncommercial computer refurbishing 
field is a minuscule part of the overall computer 
recycling industry in the USA and Canada. 

Designing for the environment (DfE) and 
implementing green production processes makes eco¬ 
nomic sense when life cycle environmental costs are 
required to be paid by the producer. DfE has producers 
considering “at the development phase of a products 
life cycle, the environmental impacts through enhanc¬ 
ing the product design which includes resource con¬ 
sumption, both in material and energy terms and 
pollution prevention.” Environmental considerations 
in product design include: waste minimization, reuse 
or recyclability, material conservation, pollution reduc¬ 
tion, lower toxicity and “eco-design” [15, 32]. Six prin¬ 
ciples of Design for the Environment (DfE) are efficiency, 
appropriateness, sufficiency, equity, systems, and scale 
[33]. However, without extended producer responsi¬ 
bility internalizing environmental costs, manufacturers 
are typically unwilling to improve product and envi¬ 
ronmental quality if it costs more or compromises 
costs, quality, and scheduling constraints [24]. Firms 
are only willing to spend 0.1% more on a product to 
improve environmental quality [34]. Thus, usually 
design takes on the character of being a problem¬ 
solving activity for a component rather than product 
development from its conceptual stage [35, 36]. 

Bibliography 

Primary Literature 

1 . Thompson S, Oh S (2006) Do sustainable computers result 
from design for environment and extended producer respon¬ 
sibility?: analyzing e-waste programs in Europe and Canada. 
ISWA/NVRD world congress 2006 conference proceedings. 
ISWA/NRVD, Copenhagen, Denmark, pp 145-161 

2. Gattuso DJ (2005) Mandated recycling of computers: a lose- 
lose-lose proposition. Competitive Enterprise Institute (CEI) 

3. Williams E, Sasaki Y (2003) Energy analysis of end-of-life 
options for computers: resell, upgrade, recycle. In: Proceed¬ 
ings of the 2003 IEEE international symposium on electronics 
and the environment, San Francisco, CA, pp187-192 

4. Lincoln JD et al. (2005) Environmentally benign materials for 
electronics: A review current development and merging 


E 




3448 


E 


Electronic Waste and Its Regulation 


technologies. In Proceedings. International symposium on 
advanced packaging materials: processes, properties and 
interfaces. IEEE, Piscataway, NJ 

5. Schmidt CW (2002) E-Junk explosion. Environ Health Perspect 
110(4):232-235 

6 . Sachs N (2006) Planning the funeral at the birth: extended 
producer responsibility in the European Union and the United 
States. Harv Environ Law Rev 30:51-98 

7. McCarthy JE (2002) Recycling computer and electronic equip¬ 
ment: legislative and regulatory approaches for "E-Waste," 
Congressional Research Service, pp 1-21 

8 . lies A (2004) Mapping environment justice in technology 
flows: computer waste impacts in Asia. Glob Environ Polit 
4(4): 76-105 

9. Terazono A et al (2006) Current status and research on E-waste 
issues in Asia. J Mater Cycle Waste Manage 8:1 -12 

10. Yanez L et al (2002) Overview of human health and chemical 
mixtures: problems facing developing countries. Environ 
Health Perspect 10(110):901 —909 

11. Price JL (1999) Reclaiming end-of-life cathode ray tubes 
(CRTs), and electronics: a Florida update, hazardous material 
Management conference, Tucson, Arizona. 15 Nov 1999 

12. Perry TS (2005) Recycling behind bars. IEEE Spectr 42(6):10-13 

13. Toffel MW (2002) The regulatory and judicial roots of product 
stewardship in the United States. Haas School of Business/ 
University of California, Berkeley 

14. McKerlie K, Knight K, Thorpe B (2006) Advancing extended 
producer responsibility in Canada. J Cleaner Prod 14:616-628 

15. Schwartz BJ, Gattuso DJ (2002) Extended producer responsi¬ 
bility: reexamining its role in environmental progress. Reason 
Public Policy Institute, Los Angeles, pp 1-65 

16. Quinn L (2006) Extended producer responsibility: new direc¬ 
tion for waste management. Eco-Journal (March/April) 

17. Nakajima N, Vanderburg WH (2005) A failing grade for WEEE 
take-back programs for information technology equipment. 
Bull Sci Technol Soc 25(6):507-517 

18. Vossennaar R, Santucci L, Ramungul N (2006) Environmental 
requirements and market access for development countries: 
the case of electrical and electronic equipment. Trade and 
environment review. United Nations, New York 

19. Lin CK, Yan L, Davis AN (2002) Globalization, extended pro¬ 
ducer responsibility and the problem of discarded computers 
in China: an exploratory proposal for environmental protec¬ 
tion. Georgetown Int Environ Law Rev 14:525 

20. Bosch J (2002) Maturity and evolution in software product 
lines: approaches, artifacts and organization. In: Proceeding 
of the second software product line conference 

21. Fiksel J (2001) Emergence of a sustainable business commu¬ 
nity. Pure Appl Chem 73:1265-1268 

22. Tomer A, Kimchi E (2004) Evaluating software reuse alterna¬ 
tives: a model and its application to an industrial case study. 
IEEE Trans Software Eng 30(9):601 —612 

23. Hewett Packard (2007) HP global citizenship report: waste 
and recycling, http://www.hp.com/hpinfo/globalcitizenship/ 
gcreport/operations/waste.html. Accessed 27 Mar 2007 


24. Armstrong T (1997) Design for sustainability, sustainable 
consumption and production. IndEco Strategic Consulting, 
Ottawa 

25. Hill B (1993) Industry's integration of environmental product 
design. In: 1993 IEEE international symposium on electronics 
and the environment. IEEE, Virginia 

26. Thompson S, Sawyer J, Bonam R, Valdivia JE (2009) Building 
a better waste generation model: validating models with recov¬ 
ery rates at 35 Canadian landfills. Waste Manage 29:2085-2091 

27. King AM, Burgess SC, Ijomah W, McMahon CA (2005) Reducing 
waste: repair, recondition, remanufacture or recycle?. Sust Dev 

28. Lynch J (2004) Baseline study of noncommercial computer 
reuse in the United States. In: International symposium on 
electronics and the environment (IEEE). CompuMentor, 
San Francisco, pp 189-194 

29. Toffel MW (2004) Strategic management of product recovery. 
Calif Manage Rev 46(2):120-141 

30. Lund R (1985) Remanufacturing: the experience of the United 
States and implications for developing countries, UNDP pro¬ 
ject management report 2, World Bank Technical Paper No 31, 
pp 24-34. In: King AM, Burgess SC, Ijomah W2McMahon CA 
(2005) Reducing waste: repair, recondition, remanufacture or 
recycle? J Sustain Develop 

31. Bras B, McIntosh MW (1999) Product, process and organiza¬ 
tional design for remanufacture - an overview of research. 
Robot Comput-lntegr Manuf 15:167-178 

32. Calcott P, Walls M (2006) Can downstream waste disposal 
policies encourage upstream "design for environment"?. The 
American Economic Review: Papers and Proceeding of the 
One Hundred Twelfth Annual Meeting of the American 
Economic Association 90 (2), 233-237 

33. Flecher K, Dewberry E (2002) Demi: a case study in design for 
sustainability. Int J Sustain High Educ 3(1 ):38—47 

34. Matthews HS, McMichael FC, Hendrickson CT, Hart DJ 
(1997) Disposition and end-of-life options for personal com¬ 
puters. Green design initiative technical report #97-10 

35. Fallman D (2003) Design-oriented human-computer interac¬ 
tion. Designing Des 5(1 ):225—232 

36. Nichol S, Thompson S (2007) Policy options to reduce con¬ 
sumer waste to zero: comparing product stewardship and 
extended producer responsibility for refrigerator waste. 
Waste Manage Res 25(3):227-233 

Books and Reviews 

Boehm B (2003) Value-based software engineering. ACM SIGSOFT 
Softw EngNotes28(2):1-11 

Bradner S, Mankin A (1995) The recommendation for the IP next 
generation protocol, Network Working Group 

Bras B (1997) Incorporating environmental issues in product 
design and realization. Ind Environ 20(1 —2):1 —19 

Business and Industry Advisory Committee (1998) Business and 
Industry Advisory Committee (BIAC) statement to the OECD 
EPR workshop 3 on extended and shared responsibility for 
products Washington, DC, 1-3 Dec 1998 



Energy and Water Interdependence, and Their Implications for Urban Areas 


E 


3449 


Cull K, Malins J (2003) Designing a sustainable future: a new 
approach to influence design practice, http://www.ub.es/ 
5ead/PDF/5%20/CullMalins.pdf 

Farrell A (1996) Sustainability and the design of knowledge tools. 
Technol Soc Mag 15:17-21 

Hauschild M, Wenzel H # Alting L (1999) Life cycle design: enroute to 
the sustainable industrial culture? Ann CIRP 48(1):393-396 

Jacard M (2006) Mobilizing Producers toward Environmental Sus¬ 
tainability: The Prospects for Market-Oriented Regulations. In: 
Glen Toner (ed) Sustainable production: building Canadian 
capacity. UBC Press, Toronto 

Lilly D, Lofthouse V Bhamra T (2005) Towards instinctive sustain¬ 
able product use. In: Sustainable creating the culture, 
Aberdeen, 2-4th Nov 

Macauley M, Palmer K, Shih J (2003) Dealing with electronic waste: 
modeling the costs and environmental benefits of computer 
monitor disposal. J Environ Manage 68:13-22 

McDonough W, Braungart M (2002) Design for the triple top line: 
new tools for sustainable commerce. Corp Environ Strategy 
9(3):251 —258 

Merola L (2006) The COTS software obsolescence threat. In: Fifth 
international conference on commercial-off-the-shelf (COTS)- 
based software systems, pp 127-133 

Moffet J, Meyer S, Pezzack J (2006) Collaborative public policy for 
sustainable production: a broad agenda and a modest 
proposal. In: Toner G (ed) Sustainable production: building 
Canadian capacity. UBC Press, Toronto 

National Safety Council (1999) Electronic product recovery and 
recycling baseline report: recycling of selected electronic 
products in the United States. National Safety Council, 
Washington, DC, p 29 

Perdan S, Azapagic A, Clift R (2000) Teaching sustainable develop¬ 
ment to engineering students. Int J Sustain High Educ 
1 (3):267-279 

Sheehan B, Spiegelman H (2005) Extended producer responsibility 
policies in the United States and Canada. In: Scheer D, Rubik F 
(eds) Governance of integrated product policy. Greenleaf Pub¬ 
lishing, Sheffield, pp 202-223 

Sheehan B, Spiegelman FI (2005b) EPR in the US and Canada. 
Resour Recycling 2:18-21 

Sheehan B, Spiegelman H (2006) Extended producer responsibility 
policies in the United States and Canada: history and status. 
Greenleaf Publishing, Sheffield. Silicon Valley Toxics Coalition, 
Poison PCs and Toxic TVs 

Swanson M (2003) Canadian producer responsibility organizations. 
E4 Partners, pp 2-10. 15 Mar 2006 

Walls M (2006) Extended producer responsibility and product 
design. Resource for the future 

Wheeler D, Thomson K, Perkin M (2006) Sustainability, social cap¬ 
ital and the Canadian ICT Secto. In: Glen Toner (ed) Sustainable 
production: building Canadian capacity. UBC Press, Toronto 

Xu CY, Morrison PD (2005) Sustainable product development and 
business performance: a product life cycle approach. ANZMAC 
conference: entrepreneurship, innovation and new product 
development 


Energy and Water Interdependence, 
and Their Implications for Urban 
Areas 

Liz Minne 1 , Arka Pandit 2 , John C. Crittenden 3 , 
Miroslav M. Begovic 4 , Insu Kim 4 , Hyunju Jeong 2 , 
Jean-Ann James 2 , Zhongming Lu 2 , Ming Xu 5 , 

Steve French 6 , Muthukumar Subrahmanyam 6 , 

Douglas Noonan 7 , Marilyn A. Brown 7 , Jess Chandler 7 , 
Yongsheng Chen 8 , Eric Williams 9 , Reginald 
Desroches 8 , Bert Bras 10 , Ke Li 11 , Michael Chang 2 
Environmental Engineering, Brook Byers Institute for 
Sustainable Systems, Georgia Institute of Technology, 
Atlanta, GA, USA 

2 Brook Byers Institute for Sustainable Systems and the 
Department of Civil 8c Environmental Engineering, 
Georgia Institute of Technology, Atlanta, GA, USA 
3 Georgia Institute of Technology, Atlanta, GA, USA 
4 School of Electrical and Computer Engineering, 
Georgia Institute of Technology, Atlanta, GA, USA 
5 School of Natural Resources and Environment, 
University of Michigan, Ann Arbor, USA 
6 Center for Geographic Information Systems and the 
School of City and Regional Planning, Georgia 
Institute of Technology, Atlanta, GA, USA 
7 School of Public Policy, Georgia Institute of 
Technology, Atlanta, GA, USA 
department of Civil 8c Environmental Engineering, 
Georgia Institute of Technology, Atlanta, GA, USA 
9 Golisano Institute of Sustainability, Rochester 
Institute of Technology, Rochester, NY, USA 
10 George W. Woodruff School of Mechanical 
Engineering, Georgia Institute of Technology, Atlanta, 
GA, USA 

1 department of Biological and Agricultural 
Engineering, University of Georgia, Atlanta, GA, USA 


Article Outline 

Glossary 

Definition of Subject 
Introduction 
Methodologies 
Current Infrastructure 
Water and Energy Nexus 


E 





3450 


E 


Energy and Water Interdependence, and Their Implications for Urban Areas 


Beyond the Water and Energy Nexus 

Steps Toward a More Sustainable Future 

Future Directions 

Acknowledgments 

Bibliography 

Glossary 

Anthroposphere That part of the environment that is 
developed or adapted by humans. 

Life cycle assessment (LCA) Analysis of the impact of 
a product or service through all the stages of its life 
including raw resource acquisition, material 
processing, fabrication, assembly, transportation, 
operation, disposal, recycling, repurposing, or 
reusing. 

Low-impact development (LID) A way of managing 
stormwater runoff through decentralized water sys¬ 
tems, such as green roofs and rainwater harvesting. 

Material flow analysis (MFA) Analysis of how 
resources move through the industrial, consumer, 
and ecological sectors. 

Resilience The ability for a structure to maintain its 
fitness and function in response to indigenous and 
exogenous stressors; it is characterized by four 
measures: 

Rapidity. A measure of the capacity to contain 
losses or prevent further degradation in a timely 
manner. 

Redundancy. A measure of the inherent 
substitutability. 

Resourcefulness. A measure of the capacity to 
mobilize resources to restore functionality in the 
event of disruption. 

Robustness. An ability of the system to with¬ 
stand a given level of stress and/or demand. 

Sustainability The development of an anthrosphere 
that exists within the means of nature. Sustainabil¬ 
ity requires that humans use no more resources 
than nature can renewably provide and generate 
only wastes that nature can assimilate in a timely 
manner and without overwhelming natural cycles. 

Urban heat-island effect The phenomenon where 
the temperature in a developed area with a high 
concentration of buildings, roads, and other phys¬ 
ical infrastructure is significantly higher than in the 
surrounding undeveloped areas. 


Urban sustainability An assessment of the long-term 
viability of a place considering the physical infra¬ 
structure (such as water, sewer, transportation, and 
buildings), socioeconomic issues (such as human 
well-being, health, wealth, safety, and security), and 
natural environment (such as resources, wildlife, 
and ecosystem health). 

Definition of the Subject 

There are many definitions for sustainability. Mathis 
Wackernagel, creator of the ecological footprint con¬ 
cept, defined sustainability as “securing people’s qual¬ 
ity of life within the means of nature” [ 1 ]. The United 
Nations’ World Commission on Environment and 
Development (the Brundtland Commission) defined 
sustainable development as “development that meets 
the needs of the present without compromising the 
ability of future generations to meet their own needs” 
[2]. Throughout this report, sustainability will be 
defined as the development of the anthroposphere 
within the means of nature. Here, the anthroposphere 
is the built environment or the environment that 
humans created for security, wealth generation, and 
comfort. For it to exist within the means of nature 
implies that the anthroposphere must use resources 
that nature provides, and generate only the kind and 
amount of waste that can be assimilated into the envi¬ 
ronment without overwhelming natural cycles. At the 
global scale, there are several examples that suggest that 
current development patterns are unsustainable. Pres¬ 
ently worldwide, there are approximately 7 billion peo¬ 
ple using 14 Gt of materials [3]. With only 5% of the 
global material use being renewable, the extraction of 
natural resources is beyond what nature can reasonably 
supply. At the other end, an enormous amount of 
synthetic and potentially toxic materials are being 
introduced into the global material cycle. 

To fully understand sustainability, one needs to take 
a whole system approach. In the traditional reductive 
engineering paradigm, each of the individual infra¬ 
structure components such as water, energy, and trans¬ 
portation are optimized separately. However, since the 
function of all the component elements depend on each 
other, a more optimal solution can occur if all the 
urban pieces are considered together. These interde¬ 
pendencies form an ecosystem of infrastructure that, 
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if functioning properly, provides a lasting basis for 
human enterprise. An example of the interdepen¬ 
dencies and the need for them to be considered 
together is the water energy nexus. In all but the most 
primitive cultures, it takes water to create useful energy 
and energy to create useful water. This nexus requires 
a comprehensive understanding not just of power gen¬ 
eration or of water resource management, but of both 
and how they connect and interact at temporal and 
spatial scales that are both large and small. And on top 
of these engineering relationships, environmental, 
social, and economical concerns must also be 
integrated. 

Why is sustainability, particularly urban sustain¬ 
ability, so important? The United Nations Environ¬ 
mental Program (UNEP) chief, Klaus Toepfer, stated 
in 2005 that “cities pull in huge amounts of resources 
including water, food, timber, metals and people. They 
export large amounts of wastes including household 
and industrial wastes, wastewater and the gases linked 
with global warming. Thus their impacts stretch 
beyond their physical borders, affecting countries, 
regions and the planet as a whole. So the battle for 
sustainable development, for delivering a more envi¬ 
ronmentally stable, just, and healthier world, is going 
to be largely won and lost in our cities” [4] . The UNEP 
expects 64% of the world’s population to live in urban 
areas by 2040, a dramatic increase from 50.5% in 2010 
[5]. For the developed countries like the United States, 
those in the European Union, and others, the share of 
urban population is projected to be 87% in the same 
time period [5]. With such an influx of people from 
rural areas to urban areas, cities must be prepared for 
this mass in-migration. With much of this movement 
expected in developing countries, those that adopt 
sustainability as a guiding framework for development 
could leapfrog other countries that are financially and 
systemically burdened with maintaining the status quo. 
For the developed world, a hybrid approach in which 
existing centralized systems are augmented by 
decentralized systems could help those cities remain 
competitive and viable. New or old, however, 
each city has different demographics, cultural values, 
fiscal and physical constraints, climate, and topology, 
and any solution to the demand for urban 
infrastructure - sustainable or not - will have to be 
uniquely tailored to consider these differences. 


Introduction 

Transmission grids are an integral piece of urban infra¬ 
structure. Understanding how electricity is used and 
finding ways to decrease the use of energy will be 
essential for developing more sustainable urban sys¬ 
tems. Likely the greatest opportunities for improving 
efficiencies and increasing sustainability lies not in 
a single solution, but in how all the pieces - old tech¬ 
nology and new - interconnect and work together. 
Transmission grids not only transmit energy to 
residencies and workplaces, but also to water treatment 
systems, to industries, to street lights, to trains, and 
to the gasoline stations where cars are fueled, just to 
name a few. These dependencies are not unidirectional, 
however. Energy production and demand are also 
significantly dependent on these systems. Pipelines 
move natural gas, oil, and water to refineries and 
power plants, while trains and ships transport coal for 
use in heating, manufacturing, and power production. 
Adding to this is the emergence of affordable and 
technologically feasible renewable or decentralized 
energy options such as photovoltaics, wind power, 
and microturbines. The most productive, healthy, and 
sustainable cities of the future will be the ones that 
learn to use all these resources to build and operate 
wholly integrated energy systems that meet the short- 
and long-term economic, social, and environmental 
needs and aspirations of their residents and region. 

Urban areas are complex interconnected systems 
analogous to ecological food webs. For an urban area 
to function properly, water, energy, and transportation 
systems need to be effectively designed to support the 
area’s desired land use. Similarly, the land-use pattern 
also should be governed according to the capacity of 
the area to host development. Beginning in the nine¬ 
teenth century, these systems have increasingly been 
designed, built, and operated as isolated systems with¬ 
out considering how they interact. Water purveyors 
have been concerned about water distribution. Power 
companies examined energy production, distribution, 
and usage. Transportation planners examined ways to 
move people and goods safely and efficiently. Generally, 
they cared about optimizing their own processes to 
satisfy their own needs. However, in recent years, 
some cities have begun to realize that they could 
improve efficiency by considering how these different 
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systems interact and function together within the 
whole urban system, rather than as individual and 
independent parts. For example, energy generation 
requires water for steam production and cooling. The 
waste heat from energy production could be used 
to produce hot water, heat homes, or provide 
air-conditioning. In another example, a Pacific North¬ 
west National Laboratory study in 2007 suggested that 
73% of the United States’ current light-duty vehicle 
fleet could be supported by the existing electric power 
infrastructure [6]. It is less clear, however, how the 
increased electricity demand required to electrify trans¬ 
portation would affect water demand. These are just 
two simple examples. Socioeconomics, policy, land use, 
and urban infrastructure form a complex system that 
makes up the city. When a holistic approach is adopted 
to analyze the urban infrastructure, many properties 
emerge that are not apparent when the components 
are analyzed individually. These emergent properties 
evolve from the economic and sociopolitical frame¬ 
work adopted by the stakeholders for the urban area. 
Hence, citizen capacity building, whereby informed 
citizens demand more sustainable infrastructure, is 
vital to driving the political agenda to improve the 
urban infrastructure. 

A city is such a multifaceted and complex system 
that one cannot easily test the effect of certain decisions 
or actions. However, the impact of decisions can be 
estimated through modeling that simulates how agents 
change their behavior in reaction to different options. 
For example, a first-generation agent-based modeling 
analysis of urban infrastructure would include the 
following: 

(a) A determination of the spatial and temporal 
demand for urban infrastructure due to develop¬ 
ment/redevelopment activities 

(b) A listing of infrastructure alternatives that satisfy 
the needs of the development 

(c) A calculation of the material and energy required 
for the different alternatives 

(d) A determination of the vulnerability and resiliency 
of the urban infrastructure alternatives against 
exogenous or endogenous risks 

(e) An assessment of the local, regional, and global 
impacts of the alternatives through life cycle 
analysis 


A second-generation model would include feed¬ 
back loops in which all previous decisions are 
reconsidered in light of outcomes revealed by the 
model. This is more realistic because of future deci¬ 
sions do depend on evolving properties. Once the 
model is verified, the dynamic policies and economic 
drivers (i.e., market forces) that will yield more sus¬ 
tainable and resilient urban infrastructure can be 
determined more accurately. More importantly, the 
model can be used to assess the sensitivity of outcomes 
to different inputs and resources can be focused on 
those components that will result in the greatest 
improvement. 

Methodologies 

There are many methods for evaluating sustainability. 
As with all methods, each has strengths and limitations. 
Some of the methods used to evaluate the sustainability 
of urban environments are discussed below. 

Resiliency 

Resiliency is the ability of a structure to maintain its 
functionality despite exogenous or endogenous 
stressors. Resilient systems are flexible and adaptable 
and typically provide a better return on material invest¬ 
ments in the long term. There are four important attri¬ 
butes that compose resiliency: robustness, redundancy, 
resourcefulness, and rapidity. Robustness is the ability of 
the system to withstand a given level of stress or demand. 
Redundancy is a measure of the inherent substitutability, 
or the ease with which a component or a whole system, 
or the function that they provide, could be replaced in 
case of a failure. Resourcefulness is a measure of the 
capacity to mobilize resources in the event of disruption. 
Rapidity is a measure of the capacity to contain losses or 
prevent further degradation in a timely manner. These 4 
“R’s” can be used as metrics in determining the sustain¬ 
ability of a system or infrastructure [7]. Urban infra¬ 
structure that is designed to minimize material and 
energy investments may not in fact be sustainable in 
the long term if the reductions result in low resiliency 
and the functioning of the infrastructure is jeopardized. 
Resiliency serves as a check against short-sighted, but 
well intentioned, actions that do not contribute to 
longer-term sustainability. 
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Life Cycle Assessment 

Life cycle assessment (LCA) is the analysis of the impact 
of one particular product across all phases of the prod¬ 
uct’s “life.” Life can be defined in multiple ways. The 
most common way an LCA is performed is from cradle- 
to-grave. This considers raw material extraction, 
manufacturing, use, and disposal. While a great deal 
of attention is often paid to a product’s manufacturing 
phase, the use phase is often the most important phase 
based on a product’s impact. For example, a car uses 
a great deal more energy and resources in the use phase 
than in the manufacturing phase. 

If a product is recycled or reused instead of being 
disposed of, it is cradle-to-cradle, and the life of the 
product is extended. For example, plastic soda bottles 
made of polyethylene terephthalate (PET, recycling 
symbol number 1) can be recycled to make new plastic 
bottles or a variety of other products, like T-shirts and 
carpeting. Cradle-to-cradle aims to reduce waste and 
decrease the amount of virgin resources that are used in 
a product. 

Other methods may shorten the view of an LCA, 
such as cradle-to-gate and gate-to-gate. Cradle-to-gate 
looks at the product from raw resources all the way 
through manufacture, but ends there. This might be 
useful particularly to a manufacturing plant or for 
products that have negligible effect in their use and 
disposal phase. Manufacturing also might look at 
a gate-to-gate LCA. This kind of LCA looks only at 
the production chain. This could be useful in compar¬ 
ing the efficiencies of different manufacturing methods 
and tools. 

Two additional LCA methods that may be 
employed are the Economic Input-Output LCA (EIO- 
LCA) and the Ecologically based LCA (Eco-LCA). The 
EIO-LCA was developed at Carnegie Mellon, and is 
used to measure the economic effects of the materials 
and energy used to make a product [8]. Ohio State 
University developed the Eco-LCA [9]. This tool adds 
to the traditional cradle-to-grave LCA by emphasizing 
more analysis on the direct and indirect effects of 
a product on pollutant emissions and economic flows. 

Life cycle assessment is a useful tool when looking 
at sustainability; however, it does have a number of 
limitations. Not all factors can be tracked or easily 
given a numerical value. For example, it is difficult to 


put a numerical value on the health effects of a product 
because toxicity and the transport and exposure routes 
may not be that easy to quantify. 

One other limitation of LCA is the cutoff problem. 
For any product or process there are a large number of 
product stages or processes upstream and it is never 
possible to include all of them. Consider, for example, 
automobile manufacturing. An automobile needs parts 
for its manufacturing. The parts need a suite of metals. 
Mining equipment is required to obtain the required 
metals. But the metal needed for the mining equipment 
also required machines to obtain it. And so the chain 
continues. In LCA, the chain needs to be truncated 
somewhere resulting in errors of omission. Additionally, 
the different LCA methods and commercially available 
software, like SimaPro, that are widely used to conduct 
LCA may use different assumptions and data resulting in 
a variation of results that must be reconciled. 

Material Flow Analysis 

A material flow analysis (MFA) is a process used to 
assess the flow of mass and energy within a defined 
system. It can be conducted at national scales, at the 
scale of a product, or at any scale in between. Urban 
metabolism is a specific MFA tool looking at a defined 
urban area. This method tends to look at the city as 
a “black box.” It considers the flows moving in or 
coming out of the city, but does not necessarily look 
at the mechanisms inside the city that cause these fluxes 
to occur. It does give a view of the amount of goods and 
energy needed to support a city, and how much waste 
the city produces. However, one constraint is the need 
for extensive input and output information for a huge 
number of materials. Some of these numbers are 
tracked at the national level, or are easy to track within 
one industry, but may be difficult to find or track for 
less controlled regions like states or cities. This 
approach works well for islands because all inputs are 
brought in by boats and recorded, and outputs also 
must go out on boat or stay on the island. With better 
data collection, this method will become increasingly 
useful. An improvement on urban metabolism would 
be to track the inflows of resources, the outflows of 
wastes, and the creation of infrastructure that creates 
wealth and comfort. The guiding principles would be 
to use fewer resources to create and operate the 


E 




3454 


E 


Energy and Water Interdependence, and Their Implications for Urban Areas 


infrastructure needed per capita. In practice, it can be 
difficult to compare one city to another because cities 
can be in different states of build-out of its infrastruc¬ 
ture (e.g., a greenfield city as compared to a more 
mature city) and its climate, demographics, and topol¬ 
ogy can impose significant confounding effects. 

Industrial Ecology 

Industrial ecology can be defined as a multidisciplinary 
system-level study focusing on the interconnections 
between the industry, economics, and the natural envi¬ 
ronment. Industrial ecology utilizes a suite of different 
tools to analyze these interdependencies. Among these 
tools are life cycle assessment and material flow analy¬ 
sis, along with designing and manufacturing for the 
environment and ecological efficiency within the 
industry. The waste or by-products of one industry 
might be a resource for another industry, and if the 
two industries are located close enough for logistical 
viability, then the waste/by-product can be utilized by 
the other industry. For example, a cement manufac¬ 
turer located near a power production facility may be 
able to use residual fly-ash from the power plant for its 
cement mix. And if a drywall production facility is also 
in the vicinity, it might be able to use the CaS0 4 from 
the cement plant as its raw material. Thus the net waste 
generation from these industries is reduced and their 
proximity saves significant fuel from the reduced need 
for transportation. This multitude of tools used in an 
industrial ecology assessment allows an industry or city 
to look at various designs and see how they affect 
inputs, outputs, and the overall effect on the economy. 

Urban Sustainability 

Urban sustainability is a systems-level holistic analysis 
of the sustainability of an urban system, particularly in 
regard to its infrastructure. Urban infrastructure is an 
intricate network of six major components: (1) private 
residences, (2) commercial establishments (including 
churches, schools, etc.), (3) water and wastewater, 
(4) energy, (5) transportation, and (6) land use. 
Urban sustainability also includes socioeconomic 
impacts and the policy drivers that cause the emergent 
properties of urban systems. With the demand for 
basic infrastructure growing with population, cities 
need to plan their infrastructures accordingly. 


Traditionally, urban areas were built with the “big- 
pipe concept,” where the natural ecology was replaced 
with hardscape to provide for the urban infrastructure. 
Nature was restricted to curbside trees, and manicured 
parks and lawns filled with nonnative plants which 
require significantly more water and nutrients to 
thrive than the indigenous varieties. More recently, 
however, the importance of ecosystem services and 
preservation of natural ecology has been realized and 
natural alternatives, such as blue-belts for stormwater 
management, are increasingly being used in place of 
hardscapes. With this method, models are being devel¬ 
oped to evaluate different options and to determine 
what social and economic decisions are needed to 
make them happen. 

Current Infrastructure 

Although current infrastructure may not be optimal, 
a large amount of capital has been invested in it and 
people still depend on it. One option for increasing 
sustainability is to develop a hybrid system of central¬ 
ized and decentralized infrastructure, which increases 
redundancy without increasing the amount of requisite 
resources. Future planning also should guide how sys¬ 
tems interact, and plan accordingly to save resources. 
Such interactions will now be described. 

Water and Energy Nexus 

Potable drinking water treatment provides clean drink¬ 
ing water, and wastewater treatment sanitizes sewage. 
Both use a great deal of energy in acquisition, treat¬ 
ment, and distribution. In 2000, the United States used 
more than 50 billion kWh of energy to attain, treat, and 
distribute drinking water and wastewater [10]. This is 
a huge demand for energy that is expected to continue 
to grow through 2050. The Electric Power Research 
Institute’s (EPRI’s) report on Water and Sustainability 
expects that by the year 2050, this number will reach 
over 75 billion kWh (Fig. 1). Already, water accounts 
for about 4% of the total energy sold in the United 
States. In California, it represents 18% of energy needs. 
And as water demand increases and good quality raw 
water becomes more difficult to find, these shares are 
likely to rise as advanced treatment technologies for 
reclaiming water or desalinating water are much more 
energy intensive. With this in mind, when a city is 
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Energy and Water Interdependence, and Their Implica¬ 
tions for Urban Areas. Figure 1 

Energy needed for drinking water and wastewater 
processes in the United States with data from the year 2000 
and predictions of use until 2050 [10] 

considering their water infrastructure, it also may want 
to keep in mind the amount of energy needed for 
various treatments for water and wastewater. 


French Nuclear Power 


In 2003, France experienced what was then their hottest 
summer on record [15]. The higher temperatures 
increased the demand for air-conditioning and refriger¬ 
ation, which translated into an increased demand for 
electricity. In turn, increased energy production created 
more waste heat, which required more water for 
cooling purposes. Though France had plenty of capac¬ 
ity to produce electricity - the nation obtains ~80% of 
its electricity from nuclear power - the heat wave and 
drought limited the amount of water available for oper¬ 
ation of the nuclear power plants, and the country was 
forced to turn off or reduce production at 17 of its 
nuclear reactors. As a result, France could no longer 
export electricity, which had been sold at €95 per MW- 
h to other EU countries, but needed instead to import 
electricity at a cost of ~€1,000 per MW-h. 

Water can be treated using different methods, and 
with methodology differences there also are differences 
in energy dependency. Surface water tends to be 


Energy and Water Interdependence, and Their Implica¬ 
tions for Urban Areas. Table 1 National average energy 
demand by various water treatment systems [1 0, 11 ] 


Water treatment and collection 

kWh/MGal 

Surface water treatment [11] 

1,422 

Groundwater treatment [10] 

1,824 

Brackish groundwater desalination [11] 

3,900-9,750 

Seawater desalination [11] 

9,780-16,500 


Energy and Water Interdependence, and Their Implica¬ 
tions for Urban Areas. Table 2 National average energy 
demand by various wastewater treatment systems [10] 


Wastewater treatment and collection 

kWh/MGal | 

Trickling filter 

955 

Activated sludge 

1,322 

Advanced treatment without nitrification 

1,541 

Advanced treatment with nitrification 

1,911 


cleanest, and as such requires the least amount of 
energy to collect and treat (Table 1). Surface water is 
not available in all areas though. Groundwater requires 
slightly more energy to treat and seawater desal¬ 
ination can use up to 75 times the amount of energy 
as surface water. 

For treating wastewater, methods include a trickling 
filter, activated sludge, advanced treatment without 
nitrification, and advanced treatment with nitrifica¬ 
tion. The energy demand, shown in Table 2, is lowest 
for the trickling filter and highest for advanced treat¬ 
ment with nitrification. The simplest treatments con¬ 
sume the least amount of energy, but they may not 
always be the most viable option given the quality of 
the water being treated and the requirements for the 
processed water. The gap in energy demand for the 
various approaches is much smaller in the case of 
wastewater than it is for water treatment. 

Finally, power is also needed to distribute water. 
Logically, the farther that water needs to travel and 
the more difficult the water is to access, the more the 
energy that will be required to move it. Distributing 
water from lakes and rivers is the least energy intensive. 
It requires about 1,400 kWh/MGal to deliver [12]. 
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Not all cities have lakes and rivers available to tap, 
however. For them, groundwater is more difficult to 
attain and requires slightly more energy. Lastly, 
discharging wastewater requires between 2,350 and 
3,300 kWh/MGal on average. 

The converse of energy being needed to provide, 
treat, and distribute water is that water is needed to 
produce, convert, and store energy. Water use in energy 
production varies depending on the source of energy. 
In the case of thermoelectric power generation, water is 
required for cooling purposes, while in case of hydro¬ 
electricity, water is used as the source of power. Water is 
also required for petroleum fuel processing, the pri¬ 
mary energy source for transportation. Thus all forms 
of energy require water, which varies in the amount and 
purpose of its use depending on the energy source. 
Thermoelectric power accounts for 39% of freshwater 
withdrawals and 52% of our fresh surface water with¬ 
drawals [13]. 

The consumptive use of water for power generation 
is from evaporative losses. Every day thermoelectric 
power loses an average of 3.3 billion gallons of water 
to evaporation [13]. As one may suspect, certain kinds 
of power generation consume more water than others 
(Table 3) . Accounting for the amount of energy produced 
and the water loss associated with each source, approxi¬ 
mately 2 gal of water per kilowatt hour is lost to evapora¬ 
tion in the United States. Hydropower has an evaporative 
loss of 18.27 gal per kilowatt hour, which is more than 35 
times the evaporative loss of water from coal-based ther¬ 
mal power. This implies that hydropower is feasible only 
in areas with an abundance of water. In contrast, photo- 
voltaics and wind consume much less water. 


Energy and Water Interdependence, and Their Implica¬ 
tions for Urban Areas. Table 3 The evaporative loss of 
water by different energy sources in the United States [14] 


Energy source 

Gal/kWh (evaporative loss) 

Hydro 

18.27 

Nuclear 

0.62 

Coal 

0.49 

Oil 

0.43 

PV solar 

0.030 

Wind 

0.001 


Example: Water and Energy Demand: A Tale of 
Two Cities 


Atlanta and Phoenix are two cities with rapidly growing 
populations. As such, the demand on water and energy 
are rapidly growing. Being that they are in different 
regions, they have different demands for water. Table 4 
describes how these cities are very different in terms of 
demand. For example, Atlanta uses more water indoors, 
but Phoenix uses a significant amount more outdoors. 
With long commutes and poor public transportation, 
Atlanta uses significantly more fuel per person per day 
than Phoenix. Some of the more important differences 
are due largely to Phoenix being a city in an arid region, 
very hot and dry, whereas Atlanta is more humid and 
wet generally. The water consumption for electricity 
production in Phoenix is more than four times that of 
Georgia. The electricity consumption for water supply 
and treatment also is around five times more in Phoenix 
than Georgia. With it being considerably more difficult 
to get water to Phoenix, these numbers are bound to 
become higher. The dependence of water on energy 
and energy on water in Phoenix are even greater than in 
Atlanta. Yet Atlanta has experienced a great deal of 
turmoil with its water resources, as described in the 
previous paragraph. So a strain on the Colorado River 
is subject to cause more damage, all things being the 
same. This table also shows that cities are not the same. 
No solution is right for all cities. 


Beyond the Water and Energy Nexus 

While water and energy are inherently connected, there 
are other connections that are equally important in the 
context of urban infrastructure (Fig. 2). Transportation 
needs energy to fuel vehicles, and the amount of energy 
available affects modes of transportation. Land use 
affects the amount of transportation activity in an 
area, and transportation is needed to made land acces¬ 
sible. The availability of water affects how land is used, 
and the way land is used affects the demand on water 
and the ways water can be treated. Likewise, the avail¬ 
ability of energy affects land and water use, and the way 
the land is used affects the energy demands and the 
kind of energy that can be created. Additionally, water 
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Energy and Water Interdependence, and Their Implications for Urban Areas. Table 4 The relationship between water 
use and electricity use for the cities of Phoenix and Atlanta 


The City of Phoenix The City of Atlanta 


Residential water 
demand 

Indoor (gpcd) 

00 

1999 data in Chap. 4 of 
The Water Environment 
of Cities [37] 

71 

2001 data in The World's Water 
2008-2009: The Biennial Report 
on Freshwater Resources 

Outdoor 

(gpcd) 

110 a 

20 

Power use 

Residential 

electricity, 

kWh/person- 

day 

36 

2005 data [38] 

41 

2005 data 

Brown et al. (2008) 

Fuel, kWh/ 
person-day 

5.5 

12.3 

Water consumption 
for electricity 
production 

gal/kWh 

7.85 

(Arizona) 

National Renewable 
Energy Laboratory 
(2003) 

1.65 

(Georgia) 

Torcellini et al. (2003) 

Electricity 
consumption for 
water supply and 
treatment 

Water, 

kWh/MG 

8,600 

[39] Conveyance, 4600 
Pumping, 2600 

Water treatment, 100 
Water distribution, 1300 

1,700 b 

City of Atlanta Watershed 
Management, [13, 36] 

Wastewater, 

kWh/MG 

10,700 

[39] Wastewater 
collection and 
treatment, 1500 
Reclaimed water, 9200 

1,830 b 


These data demonstrate the difference in use for the two different regions. Information of this sort must be considered when making 
energy and water plans for each city 

a The numbers are modified assuming four people for one household 

b The numbers are estimated based on the water and wastewater production of the City of Atlanta, the electricity use of the Atlanta 
Watershed Management Department (Thomas 2007), and the electricity demand for water supply and wastewater treatment of the South 
Atlantic Region (DOE [13]) 


is also used for extracting fuels and growing biofuels, 
which affects the transportation sector. 

Energy for Transportation 

Energy is needed to power a variety of transport vehi¬ 
cles including airplanes, trains, boats, trucks, and auto¬ 
mobiles. The fuel used in these vehicles may be equally 
diverse and may include gasoline, diesel (both petro 
and bio based), ethanol, methanol, natural gas, elec¬ 
tricity, hydrogen, solar, and even coal and wood 
(though these latter two are nearly negligible in the 
modern vehicle mix). The latest trend is to create 
hybrid vehicles that can utilize two or more fuel sources 
(e.g., gasoline and electricity). As of 2009, the trans¬ 
portation sector accounts for ~28% of the of total 


energy consumption in the United States and 72% of 
the nations petroleum consumption [16]. Transporta¬ 
tion also accounted for 3% of natural gas consumption 
and 12% of renewable energy use, mostly in the form of 
ethanol. Given these significant shares, the effect of 
transportation on energy needs to be considered 
when planning cities or city improvements. 

Transportation, Land Use, and Energy 

As the need for mobility increases, improving existing 
vehicle efficiency will lead to marginal reductions in the 
environmental impacts of transportation. Greater 
improvements, however, may be possible if switching 
to other modes of transportation is considered. In 
a 2007 study of plug-in hybrid electric vehicles, the 
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Energy and Water Interdependence, and Their Implications for Urban Areas. Figure 2 

Water, energy, transportation, and land-use systems are interconnected in urban regions 


Pacific Northwest National Lab [6] found that the 
current US electric power infrastructure could support 
the energy needs of 43-73% of the current light-duty 
vehicle fleet - the lower range resulting from the limit¬ 
ing of vehicle charging only to the 6 PM to 6 AM 
period. The study further estimated that if 73% of the 
US fleet was converted to electric vehicles, the United 
States would no longer need to import oil, and green¬ 
house emissions could be reduced by 27% (even if all 
electricity were generated by coal-fired power plants). 
While such broad-scale analyses are informative, it is 
important to note that complex local relationships 
underlie these bigger developments. 

Bras recently completed an assessment of the energy 
and carbon dioxide emissions from different modes of 
transportation in Atlanta. Many kinds of vehicles and 
fuels were included in the study: conventional gasoline, 
diesel, compressed natural gas (CNG), ethanol 85 
flexible-fuel vehicles (E85 FFV), spark-ignition (SI) 
gasoline hybrid electric vehicle (HEV), diesel HEV, SI 
plug-in hybrid electric vehicle (PHEV), diesel PHEV, 
electric vehicle, diesel vehicle, MARTA Clean Diesel 
bus, MARTA CNG bus, and MARTA rail. (MARTA is 
the Metropolitan Atlanta Rapid Transit Authority, 
which is the primary form of public transportation 


in Atlanta.) All of these transportation methods 
were evaluated on a Well-To-PUMP (WTP) and 
Pump-To-Wheels (PTW) basis for energy use and car¬ 
bon dioxide emissions (see Figs. 3 and 4). Due to low 
ridership, MARTAs buses and rail are among the worst 
for both carbon dioxide emissions per passenger dis¬ 
tance and for energy use per passenger distance. 
Increasing ridership would greatly improve these met¬ 
rics. Other results suggest that electric vehicles do not 
fare much better than conventional gasoline and diesel. 
This is due, in part, to Georgias dependence on older 
inefficient coal-fired power plants for energy. Overall, 
plug-in hybrid electric vehicles used the least energy 
and emitted the least carbon dioxide per passenger 
distance. 

The implications of the effect of ridership on 
energy use and carbon dioxide emissions from various 
transportation modes in Atlanta suggest that other 
variables beyond just vehicles and fuel are also impor¬ 
tant. Planning and developing cities for growth with 
compact land use and transportation could signifi¬ 
cantly improve their sustainability and lower their 
dependence on energy. The way land is used affects 
the modes of transportation available and their effi¬ 
ciencies. For example, a compact city favors walking, 
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Energy and Water Interdependence, and Their Implications for Urban Areas. Figure 3 

The energy use per passenger distance for various transportation modes in Atlanta, GA 



E 


Energy and Water Interdependence, and Their Implications for Urban Areas. Figure 4 

Carbon dioxide emissions per passenger distance for various transportation modes in Atlanta, GA 


heavy rail, and other modes of efficient public trans¬ 
portation. New York City is a good example of this 
kind of city, where parking fees, gas expenses, and 
traffic lead the majority of the population to walk, 


take the subway, or ride the bus. An opposing example 
might be Atlanta, where the city is not compact 
enough to offer many people the option to walk or 
use public transportation. 
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Water for Transportation 

Along with energy and land use, water is also 
intertwined with transportation. Department of 
Energy data [13], along with data from Harto’s LCA 
of low carbon fuels [17], shows that oil consumes 
between 0.03 and 0.08 gal of water per kilowatt hour 
of energy [13] (see Table 5). Natural gas production 
uses a similar amount. Water needed to produce bio¬ 
diesel from irrigated soy, though, is in the range of 
0.39-9 gal of water per kilowatt hour. Corn-ethanol 
needs up to 19 gal of water per kilowatt hour, or 
~500 times more than that for oil. From another per¬ 
spective, Webber [12] found that a traditional gasoline 
car consumes around 7-14 gal of water per 100 miles 
traveled, PHEVs consume ~24 gal per 100 miles, and 
ethanol fueled vehicles require 130-6,200 gal per 100 
miles. 

The Compounding Power of Density 

Urban density reaps a great many sustainable benefits. 
Public transportation is more accessible and commut¬ 
ing distances are generally shorter. Compact living 
also means that buildings can be closer to the 


Energy and Water Interdependence, and Their Implica¬ 
tions for Urban Areas. Table 5 The dependence of vari¬ 
ous transportation fuels on water 


Unit: Gal/kWh 

Low 

High 


Coal [13] 

0.007 

0.027 

Mining+washing 

Petroleum/oil [13] 

0.03 

0.076 

Extraction 

+refining 

Natural gas [13] 

0.01 

0.01 

Extraction 

+processing 

Corn-ethanol [17] 

1.26 

19 

Assuming 15% 
irrigation for USA 

Cellulosic 
ethanol [17] 

0.13 

0.431 

No irrigation 

Cellulosic 
ethanol [17] 

16 

19 

Irrigation 

Soy-biodiesel [17] 

0.392 

8.98 

Assuming 4% 
irrigation for USA 

Algae biodiesel [17] 

0.839 

1.762 

Enclosed 

Algae biodiesel [17] 

0.895 

18.351 

Open 


infrastructures that provide them water and energy, 
which means less energy and materials are needed for 
distribution. It is also often the case that the more 
complex power and water networks that serve dense 
urban areas have built in redundancies that can com¬ 
pensate for component failures within the system. 
Finally, density affords other cross-pollinating oppor¬ 
tunities such as combined heat and power, and 
low-impact development water infrastructure that 
also provides green space for recreation. 

Jobs, Quality of Life, and Tax Revenue 

The nexus of infrastructure extends beyond the inter¬ 
connection of the physical, functional, and environ¬ 
mental constituent pieces. It also has implications for 
a place’s social and economic well-being. Consider 
again, for example, the benefits of compact urban 
design. Land consumed by large private residential 
lots cannot be used by the collective community. 
Small private lots, on the other hand, preserve the 
opportunity to leave larger tracts available for recrea¬ 
tion, wildlife habitat, and other amenities that are not 
possible when the land is fragmented. These public 
conveniences and desired attributes improve the qual¬ 
ity of life for everyone, create a more favorable, stable, 
and job-producing business climate, and raise property 
values. And when the resultant increase in tax revenues 
is reinvested in the infrastructure, an engine of pros¬ 
perity is created (see Fig. 5). 

Steps Toward a More Sustainable Future 

Water 

Rain water is a source of clean water that does not need 
much treatment. However, most cities currently do 
not handle this resource efficiently. In most cases, 
stormwater becomes contaminated when it falls onto 
urban or agricultural surfaces, or is shunted into the 
wastewater system. Collection and treatment of 
stormwater using low-impact development (LID) tech¬ 
niques offers a myriad of benefits. The first step is to 
separate the collection of stormwater from wastewater. 
One immediate benefit is that with less water to be 
treated, less energy is needed in the water treatment 
process. Reduced flow into the wastewater stream also 
reduces the risk of sewage overflow into surface waters. 
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Energy and Water Interdependence, and Their Implications for Urban Areas. Figure 5 

The relationship between improved infrastructure and an improved socioeconomic environment 


E 


The stormwater that is collected can be used to create 
green spaces, to flush toilets, for cooling, or for fire 
fighting. Below is a sample of LID techniques that can 
be used. 


Vancouver 


The City of Vancouver was concerned about the 
stormwater and sewage that was being discharged 
into its salmon-bearing rivers [18]. The city considered 
traditional separation and decentralized treatment, but 
after discovering that putting in a centralized 
stormwater treatment facility would cost the city an 
estimated $4 billion, it decided to look into different 
options. One approach that employed low-impact 
development and decentralized treatment satisfied 
the water treatment needs while also creating green 
space which increased property values and led to an 
estimated increase of $400 million in tax revenue. It also 
resolved the environmental problem for their salmon¬ 
bearing rivers, improved conditions for other wildlife, 
and reduced the urban heat-island effect. 


Best Management Practices for Pollution Control 

Best Management Practices (BMPs) are techniques 
used in controlling water pollution and the flow of 
water. They are put in place to control pesticides, 


herbicides, nutrients, other existing pollutants, and 
emerging contaminants like pharmaceuticals and per¬ 
sonal care products from moving from the land to the 
water. Some of these techniques may simply reduce the 
amount of toxins used in a process or pesticides used 
for farming. Some structural techniques include deten¬ 
tion basins, which are used to prevent flooding in 
storms, and pervious pavement, which allows water 
to flow through its pores while holding a number of 
pollutants in the soil. These techniques help manage 
the flow of stormwater so that wastewater systems are 
less likely to be overwhelmed. They often rely on 
natural techniques, such as bioremediation or wet 
land treatment, to clean the water and are often less 
expensive and less energy intensive alternatives. 


New York 


The Safe Drinking Water Act required New York City to 
build new water filtration plants at a cost of ~$8 billion. 
Balking at the cost, the city instead acquired 
undeveloped lands to buffer the watershed, initiated 
a host of Best Management Practices (BMPs), worked 
with the farming community to develop a Whole Farm 
Plan to reduce runoff pollution, used LID techniques to 
reduce stormwater flow, and made a few upgrades to 
existing sewer and septic systems in the region. In the 
end, these actions, in lieu of the $8 billion cost of new 
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filtration plants, enhanced the economic productivity of 
the land from recreation, farming, and the protection of 
the watershed from runoff [19]. 


Green Roofs Green roofs are roofs that are partially 
or wholly covered with vegetation. They increase urban 
green space, reduce stormwater runoff, sequester car¬ 
bon dioxide directly, and reduce building energy con¬ 
sumption. They can be added to buildings in two ways, 
intensive or extensive. An intensive roof is normally 
between 8 and 12 in. thick and weighs 80-120 lb per 
square foot [20]. Extensive roofs are thinner and much 
lighter (10-50 lb per square foot). Intensive roofs are 
often built as gardens, but must be well supported due 
to the increased weight. Extensive roofs are not neces¬ 
sarily meant to be visited, but may be used for some of 
the same benefits, but at a lower cost. 

Indigenous Plants Plants that are native to a region 
require less water, fertilizers, and pesticides than exotic 
plants. Indigenous plants have adapted to local condi¬ 
tions. Less effort is needed to maintain these plants, 
and they live longer than other plants would in their 
environment. Alternatively, plants with superior car¬ 
bon sequestration qualities like yellow poplar and 
American sweet gums more effectively reduce the 
carbon footprint of a city relative to slower growing 
hardwood trees. 

Pervious Pavement Pervious pavement allows water 
to pass through the pavement for groundwater 
recharge and thereby decreases the amount of rainfall 
that enters the wastewater system. Pervious pavement 
is ideal for large, low traffic uses like parking lots, 
driveways, bike paths, and sidewalks. Not only does 
pervious pavement reduce the load of stormwater 
on wastewater treatment systems, but it also reduces 
nutrient and metal loads from runoff (by 80% and 
90%, respectively) [21]. 

LID techniques should be accessed city by city 
and based on a cost-to-benefit analysis, as not all tech¬ 
niques are proper for each city circumstance. But 
while most LID techniques are beneficial even within 
traditional centralized water systems, decentralized 


systems may offer additional benefits. With conveyance 
and distribution responsible for 80% of the energy 
demand in the water sector [10], decentralized systems 
can offer significant savings. In addition, due to the 
lower residence time of the water in the distribution 
system, the final water quality at the tap is less 
compromised. Decentralized systems also add resil¬ 
ience to the system due to higher redundancies than 
in centralized systems. That is, if a centralized system 
fails, all the customers that it serves are at risk of 
being disconnected from their water supply. When 
a decentralized system fails, another nearby station 
can supply the water needed until the damaged plant 
is fixed. 

While newly developing areas may implement 
decentralized water systems, other cities with existing 
centralized water treatment systems should not look 
to totally disassemble their infrastructure. Instead 
they should consider developing a hybrid central¬ 
ized/decentralized model. This might be as simple 
as adding a decentralized plant in a quickly growing 
area instead of expanding the centralized plant. 
And because LID techniques such as rainwater 
harvesting and low flow toilets reduce the output 
needed from a water treatment plant, LID techniques 
and decentralization simultaneously work to enable 
and enhance each other. 

Energy 

Lossil fuels are a limited resource. As such, their use 
needs to be judicious and efficient. Improving the 
efficiency of vehicles, lighting, appliances, electronics, 
heating, ventilation, and cooling is one strategy. 
Another strategy may be redesigning the energy system 
by distributing power and incorporating renewables, 
and truly integrating multiple resource planning. 
Below are described a sample of opportunities. 

Combined Heat and Power A large amount of 
energy is lost in creating electric power; only about 
30% of the energy input results in electricity delivered 
to the customer [23]. Seventy percent of the energy is 
lost in converting the primary fuel into electricity and 
distributing through the power grid. Lor that same 
amount of input energy, a combined heat and power 
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Energy and Water Interdependence, and Their Implications for Urban Areas. Figure 6 

Utilization of energy for separate electric power and combined heat and power. The amount of energy lost is much higher 
in the separate system due to the recycling of heat to make more energy (Amended from [22]) 


system could utilize ^85% of the input energy 
(see Fig. 6). Combined heat and power has the poten¬ 
tial to provide the United States 20% of the electricity 
by 2030, which could reduce an estimated 0.2 gigatons 
of carbon annually. This system also emits, on average, 
1/ 10th of the nitrogen oxides (NO x ) per kWh as com¬ 
pared to the average grid electricity [24]. Helsinki 
Energy in Finland used combined heat and power, 
and has experienced great results [24]. Sulfur dioxide 
emissions from district heating were reduced around 
90% from 1980 to 2004. This scheme also saved Fin¬ 
land six million tons of carbon emissions in 2004. 
Using less energy means fewer greenhouse gases are 
created, and with the climate changing so quickly, 
emission reduction is needed. Systems like these gen¬ 
erate electricity and use the waste heat for a variety of 
processes like chemical manufacturing, heating build¬ 
ings, or producing hot water, and thereby eliminate or 


reduce the energy and consequent environmental 
impacts needed for those demands [23, 24]. 

Energy Efficient Products In most existing applica¬ 
tions, simply upgrading or modernizing energy 
demanding products will result in improved efficien¬ 
cies and higher performance. Even low-end refrigera¬ 
tors produced today may be several times more 
efficient than refrigerators produced 20, 30, and 
40 years ago, while also providing more features and 
functionality along with a relatively short payback 
period. When choosing new or replacement products, 
however, certified Energy Star products use 20-30% 
less energy than the federal standards require [25]. 
Eligible products include appliances, electronics, 
heating and cooling systems, lighting, fans, plumbing 
fixtures, and building products like insulation and 
roofing. 
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Energy and Water Interdependence, and Their Implica¬ 
tions for Urban Areas. Figure 7 

Price of a PV module per Watt peak (constant year 2000 US 
dollars) as a function of cumulative installation of PV in 
megawatts for the period 1976-2010 [27] 


Photovoltaics and Other Renewable Energy Sources 

Every day the sun provides more than enough energy to 
meet all the demands of humans and nature. But while 
nature has evolved to efficiently harvest this free energy, 
human made systems are only now becoming cost- 
effective (see Fig. 7). Comparing photovoltaic power 
generation to coal generation, Table 6 shows for each 
source the ratio of energy output to energy invested 
water use, land use, cost of power, and jobs. While the 
lower overnight cost of power still favors coal, the other 
attributes indicate that PV is already a competitive 
alternative. With costs continuing to drop, however, 
PV could soon become the preferred power source 
with coal becoming the less attractive alternative. 

Wind, geothermal, biomass, and hydroelectricity 
are some of the other renewable energies that are grow¬ 
ing in importance. For example, a recent working 
report suggested that there is an unrealized potential 
of 29,400 MW of hydroelectricity that could be 
developed in the United States [26]. One problem 
with renewable resources, though, is their inherent 
intermittency. In order for many renewables to become 
practical, large-scale inexpensive energy storage 
methods need to be developed. These methods may 


Energy and Water Interdependence, and Their Implica¬ 
tions for Urban Areas. Table 6 A comparison of power 
generation from photovoltaic and coal energy sources 


Energy Source 

Photovoltaic 

Coal 

Energy output r-)oi 

Energy invested 

9.0 

5.0 

Water use [29] 

0.001 Gal/kWh 

0.49 Gal/kWh 

Land use 

0.51 kWh/acre 
(RET Screen 
Simulation) 

690 kWh/acre 
[30-33] 

Overnight cost [34] 

$4.75/kWh 

$2.84/kWh 

Total job-years/GW 
h (avg) [32] 

0.87 

0.11 


include various types or combinations of electrochem¬ 
ical batteries, flywheels, compressed air, superconduc¬ 
tive magnetic storage, and the oldest, the least capacity- 
constrained and most popular when available, pumped 
hydro-storage. 

Decentralizing Distribution Apart from the manner 
in which power is generated, system resiliency can be 
increased by changing how the power is distributed. 
The smaller scales inherent to renewable generators, 
such as PV, mean that these generators can more readily 
be located in various locations across the distribution 
feeder. Microgrids can be created by separating the 
distribution feeders into zones of balanced generation 
and load can by placing reclosers at the zone bound¬ 
aries (reclosers are similar to circuit breakers, but do 
not have the capability to interrupt the fault current; 
see Fig. 8, e.g.). In the case of a feeder separated from 
one of its supply substations by a fault location, such 
a network may allow the feeder still to be energized and 
supplied by other generators while the faults are being 
mitigated. That would improve reliability as measured 
by indices such as SAIFI and SAIDI (system average 
interruption frequency and duration index), CAIFI and 
CAIDI (customer average interruption frequency and 
duration index), and ASUI and ASAI (average service 
(un)availability index). Improvements in such indices 
(or composite indices) usually mean better customer 
satisfaction and diminished losses associated with 
repetitive outages, that is, increased resiliency. 
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Energy and Water Interdependence, and Their Implications for Urban Areas. Figure 8 

A depiction of an islanded distribution network. This system is split into multiple zones with balanced distributed 
generation and loads (as shown by the dotted lines and multiple colors). There are multiple distributed generators, and 
each island has a recloser at the boundaries with other zones. This allows the islands to cut off from each other in case of 
a disturbance, keeping the larger portion of the network supplied with energy. Once the problem is fixed, the islanded 
system can then automatically be reconnected to the grid (transmission network) 


Example: Distributed Renewable PV Generation 


Distributed generation consists of small-scale and 
decentralized electric energy systems. Capacities vary, 
typically in the range of several kW to hundreds of MW. 
Accurate and efficient system analysis algorithms are 


needed in order to analyze the impact of PV systems on 
various types of microgrids and distribution networks. 
The influence of uncertainties can be modeled via suit¬ 
ably optimized Monte Carlo techniques. 

Commercial PV power industries started developing 
in the 1970s. In spite of a 70% reduction of the real price 
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of PV modules over 40 years, energy from PV remains 
too expensive to compete with conventional sources. 
Political changes in the United States in the early 1980s 
ended substantial funding for solar energy research 
and, since the nation represented nearly 80% of the 
global market for solar energy at that time, virtually 
halted solar energy development around the world. In 
2004, renewable energy sources accounted for 9.6% of 
the electricity generated in the United States and 18.6% 
worldwide. China was the leading nation by renewable 
generation capacity in 2008 (598 TW h of renewable 
energy produced that year). A total of 3,584 TW h of 
renewable energy was produced in the world in 2008, 
of that only 12 TW h was from solar photovoltaic gen¬ 
erators while, for example, 16% (3,288 TW h) of world 
electric energy was produced from hydroelectric plants. 
However, PV has become one of the most rapidly grow¬ 
ing energy generation technologies. PV module ship¬ 
ments have grown at an average annual rate of 40% 
since 1996, up from 13% in the previous decade. The 
potential growth may transform PV into a $100 billion 
industry [35]. 

On the technical side of PV proliferation, electric 
power utilities are not motivated to allow interconnec¬ 
tions of customer-owned generators (such as many PV 
installations would be) to their distribution networks 
(mostly for operational reasons). Utilities tend to put 
nonutility generation under the extensive technical 
analysis. Conversely, the regulating authorities tend to 
act in favor of DG owners and support that the inter¬ 
connection be as easy and transparent as possible. 

The main objective of PV installation is to boost the 
energy savings. Figure 9 shows annual estimated 
monthly PV generation output of an assumed PV sys¬ 
tem in the Atlanta area in the United States. Figure 10 
presents how the total load demand is decreased due 
to the PV generation with capacity ranging 10-40% of 
the annual peak load at the same feeder. The figure 
represents the system performance on the 244th day of 
the year. If the distributed generator (DG) is owned by 
the customer, the actual benefit is cost saving in the 
electric bill. If the DG is owned by the utility, the benefit 
is avoided energy production from the less desirable or 
more expensive sources and reduction in transmission 
and distribution losses. This benefit means energy sav¬ 
ing in both cases. 


The second impact of solar electricity generation is 
the ecological impact. The PV output energy savings 
provide the opportunity to create the avoided carbon 
footprint by not using the more ecologically impactful 
technologies (coal, natural gas, etc.). In addition to 
reducing the effective feeder load in the distribution 
networks where it is installed (thus also reducing the 
transmission and distribution losses), PV generation can 
also bring about C0 2 reduction. It can also cause water 
footprint reduction through avoided thermoelectric 
and hydroelectric generation, and fuel reduction, such 
as coal, petroleum, and natural gas in the thermal 
power generation as in Table 6. The example assumes 
that the PV system is optimally oriented to maximize 
energy production. (If shaving of peak load levels is the 
main objective, it is possible to orient the PV system 
more westward, thereby boosting the PV output at the 
peak load times.) 


Understanding Aggregate Demand A number of 
factors need to be considered in every city when mak¬ 
ing decisions to improve sustainability. Inputting all 
the data necessary to run a good model of a city is 
a massive undertaking, but the model could be a very 
useful tool. 

For the modeling of Atlanta, Geographic Informa¬ 
tion System (GIS) tools were used to input infrastruc¬ 
ture data. The What if?™ system was used to model 
Atlanta. This software was used to show the suitability 
of various infrastructures for an area. Figure 11 shows 
the suitability for eight different land-use types. As 
Atlanta continues to develop, this can be used as 
a tool to plan infrastructures for areas that are more 
suitable. Along with suitability factors, this program 
was used to predict the land use of Atlanta as it grows. 
Figure 12 shows Atlanta’s use of land in 2010 and the 
prediction for land use in 2030, using the Business as 
Usual scenario. This scenario looks to grow the city in 
the traditional urban sprawl way, and the figure shows 
much of the land use going to residential use and 
employment centers expanding outward from the city 
center. Figure 13 offers a different comparison, this one 
of two different growth scenarios: Business as Usual 
and Compact Growth as predicted for 2030. 
This comparison shows the difference in how Atlanta 
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Energy and Water Interdependence, and Their Implications for Urban Areas. Figure 9 

Annual PV generation and load demand on a typical distribution feeder. It is assumed that the total PV capacity is equal to 
40% of the peak load demand 
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Energy and Water Interdependence, and Their Implications for Urban Areas. Figure 10 

The effect of PV generation on a typical day of the year (horizontal axis represents time of the day). Assumed PV capacity is 
40% of the annual peak load 
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Energy and Water Interdependence, and Their Implications for Urban Areas. Figure 11 

The maps show the 13 county metropolitan Atlanta area. Based on the land, the program What if?™ developed the 
suitability for different kinds of infrastructure and land use 
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Energy and Water Interdependence, and Their Implications for Urban Areas. Figure 12 

The 13 county metropolitan Atlanta area's land use. This shows the prospective land use using the Business as 
Usual scenario from the What if?™ software. Employment centers and residential areas expand outward over the 20 years, 
leaving very little area undeveloped 


can grow, with more compact living and more 
undeveloped area, or with continued urban sprawl. 

Programs such as What if?™ and UrbanSim give 
a rough prediction of the future, but the scenarios in 
these programs are not easily edited by the user. To 
date, infrastructure planning has been an exercise in 


developing a few different scenarios that fit the general 
tendencies of the agents and groups of agents that the 
infrastructure intends to serve. This approach, 
though, fails to account for a number of factors that 
affect the decision making of agents. Looking beyond 
a set of scenarios, agent-based modeling creates 
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Energy and Water Interdependence, and Their Implications for Urban Areas. Figure 13 

The 13 county metropolitan Atlanta area's prospective land use in 2030. This is according to the simulation run with 
the What if?™ software. These show the comparison of development based on two different scenarios. Business as Usual 
and Compact Growth. Compact Growth shows that much more land is left undeveloped and residences remain near 
the city, whereas Business as Usual shows an increase in urban sprawl 


equations for how people react to different constraints 
and opportunities. For example, surveys and 
behavioral analyses can be used to develop algorithms 
that describe how people react to policy decisions 
or to gauge consumer awareness. These reactions 
though are tempered by the context of many other 
variables though such as economic feasibility, the state 
of current infrastructure, or the availability of tech¬ 
nology. Modeling could provide a number of answers 
to questions concerning a city’s sustainability. 
Some of the questions it could help address are as 
follows: 

• What price are people willing to pay for renewable 
energies, and what kind of policies may be needed 
to encourage this choice? 

• How can the transportation network be better 
planned in order to make public transportation 
efficient, and at what price will people pay to choose 
more sustainable forms of transportation? 

• What incentives are needed for people to live in 
more compact spaces? 

• How can consumption of nonrenewable resources 
be reduced? 

Agent-based modeling, in combination with other 
modeling tools, is needed to predict the demand for 
urban infrastructure, how it should be built and 
designed, and what materials should be used. 


Future Directions 

For the future, there is a need to redesign our 
anthroposphere (the place where we live), so that it can 
exist within the means of nature: using only the renewable 
resources that nature can sustainably provide, while gen¬ 
erating only wastes that nature can sustainably assimilate. 
Future goals include: 

• Monitor, model, visualize, and predict the emergent 
properties of urban infrastructure systems and their 
resilience to stressors. 

• Understand the flow of resources and information 
as they move through the urban system with urban 
metabolism. 

• Develop a pedagogy for the design of complex 
systems for sustainability within the context of 
urban systems. 

• Integrate the human perspective into urban infra¬ 
structure so that socially sustainable policies and 
outcomes are produced. 

• Research and develop sustainable alternative tech¬ 
nologies for water treatment, and materials and 
energy production. 
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Glossary 

Battery A cell in which electrical energy is stored and 
discharged via Faradaic electrochemical reactions 
occurring at a predetermined voltage dependent 
on the active materials in the two electrodes. 

Control (power split) strategy The strategy used in 
a hybrid-electric vehicle to determine the fraction 
of the power demanded that is provided by the 
electric motor and engine, respectively. 

Hybrid-electric vehicle (HEV) A vehicle having both 
an electric motor and internal combustion engine 
in which the battery is maintained near a fixed 
state-of-charge via electrical energy generated by 
the electric motor/generator driven by the engine. 

Plug-in hybrid electric vehicle (PHEV) A vehicle 
having both an electric motor and internal com¬ 
bustion engine in which the battery is depleted by 
all-electric driving and recharged from the wall 
plug. 

Pulse power efficiency The fraction of the discharge 
or charge energy from the battery or ultracapacitor 
during a short, high-power pulse that is available as 
electrical energy for use by the vehicle. 

Ultracapacitor (electrochemical capacitor) A device 
in which electrical energy is stored and discharged 
via the electrostatic separation of charge in the 
micropores of activated carbon in two electrodes. 

Vehicle simulations Computer calculations of the 
operation of a vehicle based on models of the var¬ 
ious components in the driveline of the vehicle and 
a specified control strategy for its operation. 
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Definition 

Most hybrid vehicles use batteries for energy storage. 
The batteries must provide both the energy and power 
required by the vehicle. In all cases, this requires com¬ 
promises in designing the battery in that it cannot be 
optimized for either energy density or power capability. 
In addition, in most cases the battery unit is oversized 
(stores more energy than is used by the vehicle) in 
order to achieve long cycle life even for the shallow 
cycles experienced in charge-sustaining hybrid vehicles 
like the Toyota Prius. As discussed in this entry, another 
approach to providing energy storage for hybrid vehi¬ 
cles, both charge-sustaining and plug-in designs, is to 
utilize ultracapacitors either alone or in combination 
with batteries. In this approach, the batteries can be 
optimized for energy density and cycle life and 
ultracapacitors can provide the power both for accel¬ 
eration and regenerative braking. The ultracapacitors, 
which are inherently high-power devices, would be 
deep discharged to one-half rated cell voltage and still 
provide cycle life of 500,000 to one million cycles. 

The development of ultracapacitors suitable for use 
in vehicle powertrains started in about 1990 and steady 
progress in improving the performance of the devices 
has occurred since that time. This entry is concerned 
with the present status of commercially available cells 
and modules and projections of future developments 
based on the performance of prototype cells of both the 
carbon/carbon and hybrid carbon/pseudo-capacitive 
designs. The cost of ultracapacitors is currently the 
primary obstacle to the use of ultracapacitors in vehicle 
applications. In this entry, the future costs are projected 
and compared with high-power batteries based on the 
ultracapacitor material costs. Finally, applications of 
ultracapacitors in hybrid-electric vehicle powertrains 
are discussed. Key considerations are the relationships 
between the Wh stored in the ultracapacitor unit, sys¬ 
tem control strategy, and the fuel economy improve¬ 
ments attained for various driving cycles. These 
relationships are studied using detailed vehicle simula¬ 
tion results from Advisor. 

Introduction 

In the first section of the entry, the energy storage 
characteristics of lithium batteries and ultracapacitors 
are presented and compared. Of particular interest is 


the comparative power capability of lithium batteries 
and carbon/carbon ultracapacitors for charge/dis- 
charge conditions to be encountered in hybrid-electric 
vehicles. In the next section, system considerations and 
advantages of combining ultracapacitors and batteries 
for energy storage are discussed. Next laboratory and 
field experiences with ultracapacitors in hybrid-electric 
vehicles are reviewed for transit bus, light-duty, and 
fuel cell applications. 

The final sections of the entry discuss in detail 
simulations of vehicles on various driving cycles using 
ultracapacitors alone and in combination with 
advanced batteries. The vehicle simulations treat 
charge-sustaining micro- and mild-hybrid designs, 
plug-in hybrids, and fuel cell-powered vehicles. The 
charge-sustaining hybrid vehicles using ultracapacitors 
alone in place of batteries are discussed in terms of the 
sizing of the ultracapacitor unit and control strategies 
that optimize the fuel economy improvements achiev¬ 
able. For the plug-in hybrids, the emphasis is on the use 
of ultracapacitors with advanced batteries having 
energy densities greater than 200 Wh/kg. It is likely 
that these batteries will not have proportionally high 
power density and will require the assist of 
ultracapacitors to achieve all-electric operation even 
on the FUDS driving cycle. The next section of the 
entry deals with fuel cell-powered vehicles and con¬ 
siders the use of ultracapacitors to load level the fuel 
cell to increase system efficiency and thus improve 
hydrogen consumption. In the final section of the 
entry, the cost of batteries and ultracapacitors are com¬ 
pared for charge-sustaining hybrid applications. 

Energy Storage Characteristics of Ultracapacitors 
and Batteries 

Lithium batteries and ultracapacitors have been tested 
in the laboratory at the University of California, Davis 
[1-3]. A summary of the test results for the batteries is 
given in Table 1 and for ultracapacitors in Table 2. 
For both energy storage technologies, the devices 
with the highest energy density typically have the low¬ 
est power capability. The pulse power capabilities 
shown in the tables were calculated using the following 
relationships: 

Batteries: P = EF(1 — EF) V oc 2 /R 
Ultracapacitors: P = 9/16(1 — EF) V rate d 2 /R 
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Energy Storage: Ultracapacitor. Table 1 Summary of the performance characteristics of lithium-ion batteries of various 
chemistries 


Battery developer/ 
cell type 

Electrode 

chemistry 

Voltage 

range 

Ah 

Resist. 

(mOhm) 

Wh/kg 

W/kg 

90% effic. 3 

W/kg 

match, imped. 

Wt. (kg) 

Density 

(gm/cm 3 ) 

Enerdel HEV 

Graphite/ 

Ni Mn02 

4.1-2.5 

15 

1.4 

115 

2,010 

6,420 

0.445 

- 

Enerdel EV/PHEV 

Graphite/ 

Ni Mn02 

4.1-2.5 

15 

2.7 

127 

1,076 

3,494 

0.424 

- 

Kokam prismatic 

Graphite/ 

NiCoMn02 

4.1-3.2 

30 

1.5 

140 

1,220 

3,388 

0.787 

2.4 

Saft cylind. 

Graphite/ 

NiCoAl 

4.0-2.5 

6.5 

3.2 

63 

1,225 

3,571 

0.35 

2.1 

GAIA cylind. 

Graphite/ 

4.1-2.5 

40 

0.48 

96 

2,063 

5,446 

1.53 

3.22 


NiCoMn02 


7 

3.6 

78 

1,310 

3,472 

0.32 

- 

A123 cylind. 

Graphite/ 

iron 

phosph. 

3.6-2.0 

2.2 

12 

90 

1,393 

3,857 

0.07 

2.2 

Altairnano 

prismatic 

LiTiO/ 

NiMn02 

2.8-1.5 

11 

2.2 

70 

990 

2,620 

0.34 

1.83 

Altairnano 

prismatic 

LiTiO/ 

NiMn02 

2.8-1.5 

3.8 

1.15 

35 

2,460 

6,555 

0.26 

1.91 

Quallion cylind. 

Graphite/ 

NiCo 

42-2.7 

1.8 

60 

144 

577 

1,550 

0.043 

2.6 

Quallion cylind. 

Graphite/ 

NiCo 

42-2.7 

2.3 

72 

170 

445 

1,182 

0.047 

2.8 

EIG prismatic 

Graphite/ 

NiCoMn02 

4.2-3.0 

20 

3.1 

165 

1,278 

3,147 

0.41 

- 

EIG prismatic 

Graphite/ 

iron 

phosph. 

3.65-2.0 

15 

2.5 

113 

1,100 

3,085 

0.42 


Panasonic EV 
prismatic 

Ni metal 
hydride 

72-5.4 

6.5 

11.4 

46 

395 

1,093 

1.04 

1.8 


a power density P=Eff.*(1 -Eff.) Voc 2 /R, P mat ch. imped. = V 2 /4R 


where EF is the efficiency of the pulse (EF = V pu i se /V oc ). 
The matched impedance power which is often cited 
for both battery and ultracapacitor devices is calculated 
as follows: 

Pmatchimped. = V /4R 

These power relationships are derived in [4]. 

For charge-sustaining hybrids, it seems reasonable 
to cite the power capability of devices for pulse power 
efficiencies of 90-95%. For PHEVs and EVs operating 


in charge-depleting modes for the battery, it is reason¬ 
able to cite the power for efficiencies of 75-80%. In all 
instances, the power capability is proportional to V 2 /R 
indicating that high power capability requires a low 
resistance R. 

Comparisons of the Power Capabilities of Lithium 
Batteries and Ultracapacitors 

There has been considerable discussion in the literature 
[4-6] comparing the power capability of lithium-ion 
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Energy Storage: Ultracapacitor. Table 2 Summary of ultracapacitor device characteristics 


Device 

V rated 

C (F) 

R (mOhm) 

RC (s) 

Wh/kg 

(1) 

W/kg 
(95%) (2) 

W/kg 

match, imped. 

Wt. (kg) 

Vol. (1) 

Maxwell 3 

2.7 

2,885 

0.375 

1.08 

4.2 

994 

8,836 

0.55 

0.414 

Maxwell 

2.7 

605 

0.90 

0.55 

2.35 

1,139 

9,597 

0.20 

0.211 

Skeleton technologies 

2.8 

1,600 

1.3 

2.1 

5.8 

800 

7,140 

0.22 

0.13 

Yunasko b 

2.7 

55 

4 

0.22 

5.5 

5,695 

50,625 

0.009 

- 

Yunasko b 

2.7 

450 

1.3 

0.58 

5.89 

2,766 

24,595 

0.057 

0.045 

Yunasko b 

2.7 

510 

0.9 

0.46 

5.0 

2,919 

25,962 

0.078 

0.055 

Ness 

2.7 

1,800 

0.55 

1.00 

3.6 

975 

8,674 

0.38 

0.277 

Ness 

2.7 

3,640 

0.30 

1.10 

4.2 

928 

8,010 

0.65 

0.514 

Ness (cyl.) 

2.7 

3,160 

0.4 

1.26 

4.4 

982 

8,728 

0.522 

0.379 

Asahi glass (propylene 
carbonate) 

2.7 

1,375 

2.5 

3.4 

4.9 

390 

3,471 

0.210 

(estimated) 

0.151 

Panasonic (propylene 
carbonate) 

2.5 

1,200 

1.0 

1.2 

2.3 

514 

4,596 

0.34 

0.245 

EPCOS 

2.7 

3,400 

0.45 

1.5 

4.3 

760 

6,750 

0.60 

0.48 

LS Cable 

2.8 

3,200 

0.25 

0.80 

3.7 

1,400 

12,400 

0.63 

0.47 

BatScap 

2.7 

2,680 

0.20 

0.54 

4.2 

2,050 

18,225 

0.50 

0.572 

Power sys. (activated 
carbon, propylene 
carbonate) 6 

2.7 

1,350 

1.5 

2.0 

4.9 

650 

5,785 

0.21 

0.151 

Power sys. (graphitic 

3.3 

1,800 

3.0 

5.4 

8.0 

486 

4,320 

0.21 

0.15 

carbon, propylene 
carbonate) 6 

3.3 

1,500 

1.7 

2.5 

6.0 

776 

6,903 

0.23 

0.15 

JSR micro (AC/graphitic 

3.8 

1,000 

4 

4 

11.2 

900 

7,987 

0.113 

0.073 

carbon) 


2,000 

1.9 

3.8 

12.1 

1,038 

9,223 

0.206 

0.132 


(1) Energy density at 400 W/kg constant power, Vrated - 1/2 Vrated 

(2) Power based on P=9/16*(1 -EF)*V2/R, EF=efficiency of discharge 
a Except where noted, all the devices use acetonitrile as the electrolyte 
b Except for these, all devices are packaged in metal containers 


batteries and ultracapacitors. The conclusions vary 
from statements that lithium batteries have power 
capability equal to that of ultracapacitors to statements 
that ultracapacitors have an order of magnitude higher 
power capability than lithium batteries. Detailed com¬ 
parisons of the power capability of ultracapacitors and 
batteries taken from [4] are shown in Table 3. As 
indicated in the table, neither of the extreme statements 
is valid in general and that comparisons should be 
made between specific devices for specific applications. 


Comparisons are often made based on the matched 
impedance power of the two types of devices. These 
comparisons indicate that most ultracapacitors have 
a power capability (W/kg) of three to six times that of 
lithium batteries. However, for vehicle applications the 
matched impedance power is not appropriate and 
should not be the basis of comparison. 

For charge-sustaining HEV applications, a good 
basis of comparison is the W/kg at 95% efficiency 
at the SOC at which the devices will be used in the 
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Energy Storage: Ultracapacitor. Table 3 Comparisons of the power capabilities of various devices for HEVs and PHEVs 
using the different methods of calculation 


Device 

Lithium batteries Matched USABC Efficient pulse Efficient pulse 

60% SOC impedance Min/max EF =95% EF =80% 


Kokam NCM 30 Ah 

2,893 

2,502 

550 

1,848 

Enerdel HEV 

5,491 

4,750 

1,044 

3,507 

NCM 15 Ah 

Enerdel EV 

2,988 

2,584 

568 

1,908 

NCM 15 Ah 

EIG NCM 

2,688 

2,325 

511 

1,721 

20 Ah 

EIG FePhosph. 

2,141 

2,035 

458 

1,540 

15 Ah 

Altairnano LiTiO 

1,841 

1,750 

350 

1,180 

11 Ah 

Altairnano LiTiO 

4,613 

4,385 

992 

3,341 

3.8 Ah 

Ultracapacitors 





V 0 = 3/4V Rated 

Maxwell 2,890 F 

8,836 

4,413 

994 


Nesscap 3,100 F 

8,730 

4,360 

982 


BatScap 2,700 F 

18,224 

9,102 

2,050 


Yunasko 450 F 

22,838 

11,406 

2,569 


LS Cable 3,200 F 

12,446 

4,609 

1,038 


JSR 2,000 F 

9,228 

6,216 

1,400 



vehicle. On this basis, there are lithium batteries with 
the same power capability as some carbon/carbon 
ultracapacitors, but there are some ultracapacitors 
with power capability twice that of the highest power 
lithium batteries presently available for vehicle appli¬ 
cations. In other words, it is not possible to make 
general statements that are applicable to all devices of 
either type. The issue is further complicated when one 
notes that the density of the lithium batteries is about 
twice that of carbon/carbon ultracapacitors (2.2 g/cm 3 
for the batteries and 1.2 g/cm 3 for the ultracapacitors). 
Hence on a volume basis W/l at 95% efficiency, the 
differences between the batteries and ultracapacitors 


are often quite small. Hence for HEVs, batteries alone 
and ultracapacitors alone can be an option with the 
decision being based on cycle life and cost in addition 
to relative power capability [7]. 

For plug-in hybrid and battery electric vehicle 
applications, the maximum usable power density 
from the lithium-ion battery can be higher than in an 
HEV because the peak power of the driveline is used 
less frequently and consequently charge/discharge effi¬ 
ciently is less important. For example, a pulse power 
efficiency of 80% is probably sufficient and most of the 
lithium batteries have a power capability of greater 
than 1,000 W/kg, 2,200 W/l for that efficiency. 
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In addition, the battery is larger (heavier) in these 
vehicles and as a result, the power density requirement 
is less demanding. For PHEVs and EVs, the best appli¬ 
cation of ultracapacitors is likely to be in combination 
with batteries designed for high energy density, long 
cycle life, and low cost. In those cases, as discussed in 
a later section of the entry, the ultracapacitors greatly 
reduce the peak currents and dynamic stress on the 
batteries and thus extend their cycle life. 

Systems Considerations and Advantages of 
Combining Ultracapacitors and Batteries 

Vehicle Design Considerations 

The energy storage requirements vary a great deal 
depending on the type and size of the vehicle being 
designed and the characteristics of the electric 
powertrain to be used. Energy storage requirements 
for various vehicle designs and operating modes are 
shown in Table 4 for a midsize passenger car. Require¬ 
ments are given for electric vehicles and both charge- 
sustaining and plug-in hybrids. These requirements 
can be utilized to size the energy storage unit in the 
vehicles when the characteristics of the energy storage 
cells are known. In some of the vehicle designs consid¬ 
ered in Table 4, ultracapacitors are used to provide the 
peak power rather than batteries. 

In the cases using only ultracapacitors, the key issue 
is the minimum energy (Wh) required to operate the 


vehicle in real-world driving because the energy density 
characteristics of ultracapacitors are such that the 
power and cycle life requirements will be met in 
most cases if the unit is large enough to met the 
energy storage requirement. As shown in Table 4, for 
passenger car applications, the energy storage in the 
ultracapacitor can be 150 Wh or less even if the 
ultracapacitor is used alone for the energy storage. 

Reasons for Combining Ultracapacitors and 
Batteries 

Most hybrid vehicles use batteries for energy storage. 
The batteries must provide both the energy and power 
required by the vehicle. In all cases, this requires com¬ 
promises in designing the battery in that it cannot be 
optimized for either energy density or power capability. 
In addition, in most cases the battery is oversized 
(stores more energy than is used by the vehicle) in 
order to achieve long cycle life even for the shallow 
cycles experienced in charge-sustaining hybrid vehicles 
like the Toyota Prius. Combining ultracapacitors with 
batteries permits the design of the batteries to be opti¬ 
mized for energy density and cycle life and limits the 
magnitude and frequency of maximum power pulses 
experienced by the batteries during vehicle operation. 
The result will be a lighter, smaller, and lower cost 
battery which will have much longer cycle life (by as 
much as a factor of 2-3) and will operate over 


Energy Storage: Ultracapacitor. Table 4 Energy storage unit requirements for various types of electric drive midsize 
passenger cars 


Type of electric 
driveline 

System 
voltage V 

Usable 

energy storage 

Maximum pulse power at 
90-95% efficiency (kW) 

Cycle life (number 
of cycles) 

Usable depth- 
of-discharge 

Electric 

300-400 

15-30 kWh 

70-150 

2,000-3,000 

deep 






70-80% 

Plug-in hybrid 

300-400 

6-12 kWh battery 

50-70 

2,500-3,500 

deep 



100-150 Wh 



60-80% 



ultracapacitors 




Charge- 

150-200 

100-150 Wh 

25-35 

300 K-500 K 

Shallow 

sustaining hybrid 


ultracapacitors 



5-10% 

Micro-hybrid 

45 

30-50 Wh 

5-10 

300 K-500 K 

Shallow 



ultracapacitors 



5-10% 
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significantly narrower ranges of voltage and tempera¬ 
ture than would be the case using only a battery. In 
addition, the cooling requirements and complexity of 
the battery management unit of the battery will be 
reduced and be less costly. 

The ultracapacitors would provide the high-power 
pulses in discharge and accept all the charge pulses 
during regenerative braking of the vehicle. If needed, 
the ultracapacitors can be recharged at high rates 
during regenerative braking and/or using power from 
the engine or fuel cell. The ultracapacitors would be 
deep discharged (up to 75% of their rated energy) 
when that energy is needed to power the vehicle. Even 
for deep discharge operation, the ultracapacitors can be 
expected to have a cycle life of at least 500,000 cycles. 
As noted in Table 4, the energy stored (Wh) in the 
ultracapacitors can be quite limited being less than 
150 Wh for midsize passenger cars. Larger 
ultracapacitor units would be needed for buses and 
trucks, but their advantages would be similar to those 
used in passenger cars. The weight and cost of 
the ultracapacitor unit would be proportional to the 


energy stored so it is advantageous to limit the energy 
stored to the minimum required for a particular appli¬ 
cation. As shown in Fig. 1, for most applications, it will 
be necessary to utilize interface electronics to control 
the power flow from the ultracapacitors because their 
voltage changes significantly during discharge. This 
permits a large fraction of the usable energy of the 
ultracapacitors to be utilized and minimizes their cost. 

Sizing Components: Weight and Volume 

Whether or not a vehicle designer will consider using 
ultracapacitors alone or in combination with batteries 
instead of a battery pack for energy storage will depend 
to a large extent on the relative weight, volume, and 
cost of the two systems. 

Consider first the case of charge-sustaining hybrid 
vehicles (HEVs). In this case, the energy storage unit is 
sized by both usable power (kW) and energy storage 
(Wh) requirements. For batteries, the key issues are the 
power requirement and the minimum usable energy 
consistent with high cycle life for shallow cycles. 


E 



Energy Storage: Ultracapacitor. Figure 1 

Schematics of powertrains using ultracapacitors and batteries 
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The total energy stored in the battery unit is of second¬ 
ary importance as far as operation of the vehicle is 
concerned, but it has a strong effect on the weight, 
volume, and cost of the unit. For ultracapacitors, the 
key issue is the minimum energy (Wh) required to 
operate the vehicle in real-world driving because the 
energy density characteristics of ultracapacitors are 
such that the power and cycle life requirements will 
be met if the unit is large enough to meet the energy 
requirement. 

Energy storage units for charge-sustaining hybrid 
powertrains for midsize passenger cars are shown 
in Table 5. For each application, the usable power 
and energy, total energy stored, and incremental 


state-of-charge for the unit and its weight and volume 
are shown in the table. The results shown in Table 5 
indicate that for charge-sustaining hybrids, either bat¬ 
teries or ultracapacitors can be used alone. All the units 
listed should function satisfactorily for the applications 
indicated with the decision concerning their relative 
attractiveness based to a large extent on cost, which at 
the present time is uncertain for all the systems. The 
results in Table 5 also indicate that a combination of 
lead-acid batteries and ultracapacitors would result in 
a relatively low weight unit for a micro-hybrid having 
a maximum electric motor power of 5-10 kW. The 
economics of such a combination should be attractive 
because of the low cost of the lead-acid batteries. 


Energy Storage: Ultracapacitor. Table 5 Characteristics of various technologies/types of batteries for use in charge- 
sustaining hybrid vehicle applications 


Charge-sustaining hybrids (HEVs) 

Technology 

Cor Ah 

System 

voltage 

A SOC% 

Total 

energy Wh a 

Max. 

Power kW b 

Pulse 

efficiency% 

Weight of 
cells kg 

Volume of 
cells 1 

Ultracapacitors 

Carbon/carbon 

2,400 

160 

75 

100 

>40 

90 

22 

15 

microporous C/ 
graphitic C 

2,400 

160 

75 

150 

27 

90 

18 

13 

Lithium-ion batteries 

Graphite/Ni Co Al 

9 

160 

6.6 

1,500 

31 

91 

24 

11 

Iron phosphate 

2.2 Ah cells 

11 

160 

5.5 

2,000 

27 

91 

23 

11 

Lithium titanate 

11 Ah 

11 

160 

5.5 

1,800 

27 

90 

26 

14 

Lithium titanate 

2.5 Ah 

4 

160 

16.6 

600 

27 

90 

16 

9 

Nickel metal 
hydride (Prius) 





14 

90 



10 

160 

6.25 

1,600 

25 

80 

35 

19 

Micro-hybrids 

Lead-acid 

50 

45 

5 

1,800 

5 

90 

53 

19 

Lead-acid/ 

ultracapacitors 

28 


10 

1,000 



29 

10 

4,000 

45 

75 

50 

15 

95 

10 

8 







39 total 

18 total 

Nickel metal 
hydride (Prius) 

24 

45 

10 

1,100 

10 

90 

24 

13 


a Total energy stored 

b Power at the pulse efficiency indicated 
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Next consider energy storage units for plug-in 
hybrid vehicles (PHEVs). A key design parameter for 
PHEVs is the all-electric range. Energy storage units 
will be considered for all-electric ranges of 10, 15, 20, 
30, and 40 miles. The acceleration performance of all 
the vehicles will be the same (0-60 mph in 8-9 s). For 
the batteries, the usable depth of discharge will be taken 
to be 70%. For the ultracapacitors, it is assumed that 
they can be cycled from rated to one-half rated voltage 
using 75% of the total stored energy. In the PHEVs, the 
battery will be sized by the energy needed to sustain 
a specified all-electric range. Hence the weight and 
volume of the battery pack follow directly from its 
energy density (Wh/kg, Wh/1). This means that battery 
technologies with high energy density will be strongly 
favored for PHEV applications. However, the batteries 
must also be able to meet the power requirements of 
the PHEV powertrain which can approach those of the 
battery-powered EV. Unfortunately, battery designs to 
attain the maximum energy density in most cases 
require a sacrifice in power capability as shown in 
Table 1. Calculated power density requirements for 
PHEVs of various ranges and maximum power [8] 
are given in Table 6 for batteries having energy densities 
of 70, 100, and 200 Wh/kg. The values shown are for 
a midsize passenger car having an energy usage of 
250 Wh/mile in the all-electric mode and maximum 
electric motor power of 50 and 70 kW. 

The required power densities for all-electric ranges 
of 20 miles and less are high especially for high energy 
density batteries. When the power density required 


exceeds that of the battery, it is of interest to consider 
using an ultracapacitor to augment the power of the 
battery. The usable pulse power from a battery depends 
on the efficiency that is acceptable for a particular 
application. The power densities for efficiencies of 
95% and 80% for the various battery technologies are 
shown in Table 3. If it is assumed that 80% efficiency is 
satisfactory for use of the battery in the charge- 
depleting mode of the PHEV, it can be concluded 
from Tables 3 that ultracapacitors would be needed by 
the 200 Wh/kg lithium-ion and the 70 Wh/kg nickel 
metal hydride batteries. This result is shown in Table 7. 
The energy storage unit weights for the PHEVs of 
various all-electric ranges and battery energy density 
are given in Table 8. The combination of 
ultracapacitors and 200 Wh/kg lithium-ion batteries 
have the lowest weight for all the ranges. For higher 
energy density batteries (>250 Wh/kg), it seems even 
more likely that ultracapacitors will be needed to meet 
the power demands of PHEVs. The combination of 
ultracapacitors and batteries will also yield the highest 
efficiency during the charge-sustaining mode of vehicle 
operation after the energy of the batteries has been 
depleted. 

As noted above, ultracapacitors can be used with 
70 Wh/kg nickel metal hydride batteries for PHEV 
applications due to their limited power capability. 
Even using ultracapacitors, the all-electric range with 
nickel metal hydride batteries is limited to about 
15 miles as otherwise the weight of the energy storage 
unit would exceed 100 kg. 
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Energy Storage: Ultracapacitor. Table 6 Battery sizing and power density for various ranges and motor power 


Range 

miles 

kWh 

a needed 

kWh b 

stored 

Battery 200 Wh/kg 

Battery 100 Wh/kg 

Battery 70 Wh/kg 

b kg 

50 kW 
kW/kg 

70 kW 
kW/kg 

kg 

50 kW 
kW/kg 

70 kW 
kW/kg 

kg 

50 kW 
kW/kg 

70 kW 
kW/kg 

10 

2.52 

3.6 

18 

2.78 

3.89 

36 

1.39 

1.94 

51 

0.98 

1.37 

15 

3.78 

5.4 

27 

1.85 

2.59 

54 

0.92 

1.30 

77 

0.65 

0.91 

20 

5.04 

7.2 

36 

1.39 

1.94 

72 

0.69 

0.97 

103 

0.49 

0.68 

30 

7.56 

10.8 

54 

0.93 

1.30 

108 

0.46 

0.65 

154 

0.32 

0.46 

40 

10.1 

14.4 

72 

0.69 

0.97 

144 

0.35 

0.49 

206 

0.24 

0.34 


a Vehicle energy usage from the battery: 250 Wh/mi 

b Usable state-of-charge for batteries: 70%, weights shown are for cells only 
























3480 


E 


Energy Storage: Ultracapacitor 


Energy Storage: Ultracapacitor. Table 7 Battery/ultracapacitor combinations for various all-electric ranges 


Wh/kg 

100 

200 

100 

90 

70 

70 

Range miles 

GAIA 

Quallion 200 

Quallion 100 

A123 

Altairnano 

Panasonic Nickel mt.hyd. 

10 

N/N a 

Y/Y 

N/N 

N/N 

N/N 

Y/Y 

15 

N/N 

Y/Y 

N/N 

N/N 

N/N 

Y/Y 

20 

N/N 

Y/Y 

N/N 

N/N 

N/N 

Y/Y 

30 

N/N 

Y/Y 

N/N 

N/N 

N/N 

Y/Y 

40 

N/N 

N/N 

N/N 

N/N 

N/N 

Y/Y 


a X/X means need at 50 kW/need at 70 kW yes Y or no N 


Energy Storage: Ultracapacitor. Table 8 Storage unit weights using a combination of batteries and ultracapacitors for 
various all-electric ranges 


Wh/kg 

5 

200 


100 


70 


Range 

miles 

Ultracap kg a 

Battery Kg b 

Combination kg 

Battery kg 

Combination kg 

Battery kg 

Combination kg 

10 

20 

18 

38 

36 

56 

51 

71 

15 

20 

27 

47 

54 

74 

77 

97 

20 

20 

36 

56 

72 

92 

103 

123 

30 

20 

54 

74 

108 

128 

154 

174 

40 

20 

72 

92 

144 

164 

206 

226 


a The ultracapacitor unit stores 100 Wh usable energy 

b Weights shown are for cells only, packaging into modules not Included 


Power Split Strategies 

As noted previously, interface electronics is needed 
(Fig. 1) in hybrid powertains using a combination of 
batteries and ultracapacitors to control the power split 
between the two energy storage units. Selection of 
a control strategy that will both limit the maximum 
power demanded of the batteries and improve the effi¬ 
ciency of the powertrain is critical to the design of 
attractive HEVs and PHEVs. One approach for the 
PHEV applications is to have the battery meet the aver¬ 
age vehicle power requirement and the ultracapacitor 
meet the variable, pulse power requirement. The average 
vehicle power demand can be calculated over some 
specified time period (30-120 s) depending on the driv¬ 
ing conditions. As discussed later, vehicle simulations 
indicate that this approach results in relatively slow 


variations in battery power and should lead to signifi¬ 
cantly increased battery cycle life and reduced need for 
high-power cells. Using this approach, the interface elec¬ 
tronics could be directly attached to either the batteries 
or ultracapacitors (see Fig. 1). 

As in the case of the HEVs, the key element of the 
system control strategy is how the ultracapacitors are 
recharged utilizing engine power via the motor/gen- 
erator. The vehicle simulations have shown that the 
ultracaps should be recharged during periods in which 
the engine is needed to meet the vehicle power 
requirements. In this way, the engine operates nearly 
all the time at high efficiency and not at low output 
torque conditions resulting in large improvements 
(>35%) in fuel economy compared to the reference 
engine alone case. 
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Laboratory and Vehicle Field Experiences Using 
Ultracapacitors 

Laboratory Testing of Ultracap/Battery 
Combinations 

Most of the research [9-11] involving the testing of 
ultracap/battery combinations has been performed on 
bench-top arrangements. In most cases, these tests have 
included state-of-the-art interface electronics that con¬ 
trolled the currents from the ultracapacitors. The intent 
of these studies has been to show that the stress on the 
batteries due to repeated high current pulses was 
greatly reduced using ultracapacitors resulting in 
extended battery cycle life. The test data show clearly 
that the mean currents from the batteries are greatly 
reduced, but the effect on battery cycle life is not yet 
clear from the data. Much further testing of this type, 
especially using lithium batteries, is needed. The 
test results [12] from a simpler test utilizing small 


spiral wound lithium batteries (1.4 Ah) and 310 
F ultracapacitors without interface electronics are 
shown in Fig. 2. The data show the effect of load 
leveling on the cycle life of the cells which were pulsed 
at the 5C rate. The magnitude of the effect (a factor of 
7-8) is likely much greater than can be expected with 
larger batteries, but nevertheless, the data do indicate 
that load-leveling batteries can be expected to signifi¬ 
cantly improve cycle life. 

Vehicle Field Experience Using Ultracapacitors 

There has been continuing experience in demonstrat¬ 
ing the use of ultracapacitors in hybrid-electric 
powertrains in heavy-duty [13-15] and light-duty 
[16-18] vehicles. In these applications, the capacitors 
provide the peak power for acceleration and recover 
energy during periods of braking. An engine generator 
or fuel cell is used to generate onboard electricity to 
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Pulsed discharge at average rate of 1 C comprising 5 C 5 s pulses at a 20% duty factor. Each cycle 80% (1.12 Ah) 
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recharge the capacitors as needed. The control strategy 
for the hybrid vehicle should be designed to operate the 
engine or fuel cell as close to the maximum efficiency 
condition as the driving cycle permits. Capacitor units 
for use in hybrid vehicles must be high voltage as the 
peak power for these applications is high (up to 
200 kW). For passenger cars and SUVs, the voltage is 
likely to be about 300 V and for transit buses about 
700 V. This would require 125 cells in series for the 
light-duty vehicles and 250 cells in series for the buses. 

Transit Bus Applications In the USA, the company 
most active in utilizing ultracapacitors in hybrid- 
electric powertrains for buses and large trucks has 
been the ISE Corporation in San Diego, California 
[13]. ISE has developed a 360 V capacitor unit 
consisting of 144 2,600 F Maxwell cells connected in 
series (see Fig. 3). The weight and volume are 114 kg 
and 189 1, respectively. The unit stores 0.325 kWh of 
energy (0.245 kWh usable). In a transit bus, two of the 
units are used in series resulting in a voltage of 720 V 
and energy storage of 0.650 kWh. The peak power 



capability of the combined unit is over 300 kW. ISE 
utilizes this capacitor unit with a 225 kW electric motor 
in series hybrids using gasoline and diesel engines and 
hydrogen fuel cells. Since the capacitor unit stores only 
about 0.5 kWh, it can provide power only during vehi¬ 
cle acceleration and recovers energy during braking. 
The engine or fuel cell must provide all the power 
during cruise and hill climbing. ISE has built over 100 
buses using the ultracapacitor energy storage units for 
transit companies in Southern California. The buses 
are in daily revenue service. Field experience has shown 
that the capacitor unit functions as planned with a high 
reliability. Field operation of the buses has shown a fuel 
economy improvement of 25% compared to the stan¬ 
dard diesel engine buses [13]. 

Hybrid-Electric Passenger Car Applications Unlike 
the transit bus applications described in the previous 
section, there has not been a commercialized use of 
ultracapacitors in passenger cars as of 2010. In the last 
couple of years, there has been considerable discussion 
[19, 20] of marketing of micro-hybrids with stop-start 



Nominal voltage 

360V* 

Peak voltage 

403V** 

Rated current 

400A 

Capacitance 

18.05F 

Total energy stored nominal/max. 

0.325k Wh/0.407kWh 

Leakage current 

5mA nominal 

Operating temperature 

-35 to 65°C 

Weight 

240lbs 

Dimensions wxlxh 

24"x40"x12" 

Standard pack 

144 capacitors 

Fire suppression system 

Heat activated halotron system 
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The ISE ultracapacitor unit (360 V, 0.325 kWh) 
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capability to improve fuel economy particularly in 
Europe. It is likely that when these cars are finally 
marketed some of them will use ultracapacitors. 
There have been several noteworthy demonstrations 
of ultracapacitors in hybrid-electric passenger cars. As 
discussed in the following paragraphs, these projects 
were very successful and showed that ultracapacitors 
can be used in place of batteries in mild hybrid vehicles 
with fuel economy improvements at least as high as 
with batteries. 

The first project was termed “SUPERCAR” and was 
funded by the European Community (EC). It was 
a joint project between EPCOS [16], the ultracapacitor 
developer, and Siemens VDO [21], the vehicle integra¬ 
tor. The parallel hybrid passenger car (VW Golf) com¬ 
bined the ultracap module and lead-acid battery into 
a 42 V, 10 kW (peak) electric driveline with a 66 kW 
engine. The vehicle was tested on both the chassis 
dynamometer and the road. The tests showed a 
16-18% improvement in fuel economy compared to 
the standard ICE car. The capacitor unit consisted of 24 
3,600 F cells connected in series (see Table 2 for the cell 
characteristics). The rated voltage of the unit is 60 V 
and its capacitance and resistance are 150 F and 
8 mOhm, respectively. The unit is shown in Fig. 4. 

A second, more recent project (2009), is described 
in detail in [17]. This project was performed by the 
National Renewable Energy Laboratory (NREL) and 
General Motors (GM). It involved installing 
ultracapacitors in a 2007 Saturn Vue hybrid along 
with the nickel metal hydride battery. The vehicle 
could be operated on either of the energy storage 
units alone in order to make a direct comparison of 


its operation on the ultracapacitors and battery. The 
ultracapacitor energy storage unit consisted of one or 
two 48 V, 165 F modules from Maxwell. Each module, 
which consisted of 18 3,000 F cells connected in series 
(see Table 2 for the characteristics of the cells), stored 
about 35 Wh. A special UCAP state estimator was 
utilized to maintain the ultracapacitors in the required 
range of state-of-charge. 

The 2007 Saturn Vue hybrid is a mild hybrid using 
a belt alternator/starter in parallel with a 127 kW, var¬ 
iable valve timing gasoline engine. The alternator/ 
starter electric machine has a continuous rating of 
3 kW and a peak power of 6-8 kW. The nickel metal 
hydride battery has a nominal rating of 36 V, but 
vehicle testing indicated it operates in the 38-42 V 
range most of the time at 75% state-of-charge. The 
characteristics of the energy storage units are compared 
in Table 9. 

Extensive dynamometer testing of the vehicle was 
done operating on the ultracapacitors (both with one 
module and two modules) and the nickel metal hydride 
battery. The vehicle was tested on several driving cycles 
(FUDS, FHWY, and US06). The three energy storage 
options could be utilized interchangeably using the 
same vehicle operating strategy. The vehicle operated 
essentially the same on each of the energy storage 
options. The testing indicated that only 25-30 Wh 
was used from the ultracapacitor units even on 
the US06 driving cycle. When using only one 
ultracap module on the FUDS cycle, the state-of-charge 
(V/V ra ted) of the ultracap unit varied from 75-90%. 
There was no difference in the measured fuel economy 
when the vehicle was operated on the ultracapacitors or 
the nickel metal hydride battery for any of the three 


E 



Energy Storage: Ultracapacitor. Figure 4 

The EPCOS 60 V ECC module (150 F, 56 Wh) 


Energy Storage: Ultracapacitor. Table 9 Characteristics 
of the energy storage units used in the Saturn Vue test 
vehicle 


Storage unit 

Weight 

(kg) 

Volume 

(1) 

Energy 

stored 

Nickel metal hydride 
battery 

24.7 

15.4 

1.1 kWh 
(estimated) 

Ultracapacitors 




One 48 V module 

14.8 

11.2 

35 Wh 

Two 48 V modules 

29.6 

22.4 

70 Wh 
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driving cycles. Test track acceleration tests of the 
vehicle indicated no differences in the acceleration 
times for 0-60 or 40-60 mph using the one module 
ultracapacitor and nickel metal hydride battery units. 

Fuel Cell Vehicle Applications Ultracapacitors can 
also be used to meet the peak power transients in 
vehicles powered by fuel cells [22, 23]. The most pub¬ 
licized use of ultracapacitors in fuel cell vehicles has 
been that by Honda [22] in the “Honda FCV.” In the 
Honda FCV, the capacitors were used to load level the 
fuel cell during accelerations and to recovery energy 
during regenerative braking. In the Honda fuel cell 
system, the ultracapacitors are used without interface 
electronics. The capacitors functioned very well and 
resulted in significant improvements in system energy 
efficiency. 

A detailed study of the use of ultracapacitors in fuel 
cell-powered vehicles is given in [24]. The study 
showed that ultracapacitors are well suited to be used 
with fuel cells. This is especially true when the fuel cell 
and ultracapacitors are connected in parallel because 
the effective rest voltage of the capacitor is variable and 
their high-power capability permits them to meet the 
maximum power requirement of the system at all volt¬ 
ages between the rest voltage and the minimum voltage 
at which the fuel cell is to be operated. In this mode of 
operation, the system is self-controlling with the 
ultracapacitor and fuel cell sharing the power demand 
based on the system voltage of operation. Tests [24] 
were performed on a system (Fig. 5) consisting of the 
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Photograph of the ultracapacitor/fuel cell test setup 


Ballard 1.2 kW Nexa fuel cell and an ultracapacitor 
bank made up of 17 Ness 2,500 F capacitors. The 
system was tested for 10 s pulses up to 5 kW. An 
analytical model of the fuel cell/capacitor system was 
derived and programmed using an EXCEL spreadsheet. 
Comparisons of the measured and modeled response 
of the system at various power levels (Fig. 6) indicated 
that the system operated as modeled and that the self- 
control of the fuel cell was as expected. SIMPLEV 
simulations [24] on the FUDS driving cycle were 
also performed for a midsize SUV powered by 
ultracapacitors connected in parallel with a fuel cell. It 
was found that the ultracapacitors significantly load 
leveled the fuel cell and the behavior of the system 
was much like that measured with the Ballard 1.2 kW 
Nexa fuel cell. 

Hybrid Vehicle Simulations Using Ultracapacitors 
Alone 

Powertrain Control Strategies with Ultracapacitors 
Alone in Place of Batteries 

Schematics of hybrid powertrains using ultracapacitors 
and batteries are shown in Fig. 1. If an ultracapacitor 
unit is used alone, there is no need for special electron¬ 
ics if the motor electronics can handle the increased 
voltage swing of the ultracapacitors. If that is not the 
case, then interface electronics is needed between the 
ultracapacitors and the motor electronics. 

When ultracapacitors are used alone as the energy 
storage unit in a charge-sustaining hybrid (HEV), the 
objective of the control strategy is to permit the engine 
to operate near its maximum efficiency. As shown in 
[25-27], this can be done by operating the hybrid 
vehicle on the electric drive only when the power 
demand is less than the power capability of the electric 
motor; when the vehicle power demand exceeds that of 
the electric motor, the engine is operated to meet the 
vehicle power demand plus the power to recharge the 
ultracapacitor unit. In this mode, the electric machine 
is used as a generator and the engine operating point is 
selected along its maximum efficiency line (torque vs. 
RPM). The recharging power is limited by the power of 
the electric machine because ultracapacitors can have 
pulse power efficiency greater than 95% for W/kg 
values of over 2,000 W/kg (see Table 3). This control 
strategy is referred to as the “sawtooth” strategy 
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Energy Storage: Ultracapacitor. Figure 6 

Comparison of the measured and modeled ultracapacitor and fuel cell powers for the 5 kW pulse power test 


because a plot of the ultracapacitor state-of-charge 
(SOC) has the form of a saw blade. It can also be 
referred to as the “high engine efficiency” strategy. 

Vehicle Simulation Results 

Simulations of midsize passenger cars using the 
ultracapacitors in micro-hybrid and charge-sustaining 
hybrid powertrain designs have been performed using 
the Advisor vehicle simulation program modified with 
special routines at UC Davis [25-27]. All the 
powertrains were in the same vehicle having the fol¬ 
lowing characteristics: test weight 1,660 kg, C d =0.3, 
Ap = 2.25 m 2 , RRCF = 0.009. The engine map used in 
the simulations was for a Ford Focus 2 1, 4-cylinder 
engine. The rated engine power was 120 kW for the 
conventional ICE vehicle and the micro-hybrid 
and 110 kW for the charge-sustaining hybrids. All the 


hybrids use the single-shaft arrangement similar to 
the Honda Civic hybrid. The same induction electric 
motor map was used for all the vehicle designs. 

The simulation results are summarized in Table 10 
for a conventional ICE vehicle and each of the hybrid 
designs. Results are given for fuel usage in terms of both 
1/100 km and mpg for various driving cycles. It is clear 
from Table 10 that large improvements in fuel usage 
are predicted for all the hybrid powertrains using 
ultracapacitors for energy storage. The simulation 
results will be discussed separately for each hybrid 
design. 

Micro-hybrids The results for the micro-hybrids are 
particularly interesting and somewhat surprising, 
because of the large fuel economy improvements 
predicted. These improvements were about 40% on 
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Energy Storage: Ultracapacitor. Table 10 Summary of the vehicle fuel economy simulation results using ultracapacitors 
for various driving cycles 


Driveline Voltage and L/IOOkm/mpg 


type Energy storage type weight cells (kg) EM Peak kW FUDS HWFET US06 ECE-EUDC 


ICE baseline 




10/23.8 

6.9/34.4 

9.6/24.7 

9.7/24.6 

Micro-HEV 

Lead-acid/ultracaps 

48 






Carbon/carbon 

6 kg 

6 

5.7/41.7 

5.3/44.7 

7.8/30.6 

5.9/40.2 

Hybrid carbon 

3 kg 

6 

7.3/32.8 

6.3/38.0 

8.9/26.7 

7.1/33.4 

Charge- 

sustaining 

hybrid 

Ultracaps 

200 






Carbon/carbon 

30 kg 

35 

5.4/43.8 

5.0/47.9 

7.1/33.6 

5.5/43.2 

Hybrid carbon 

13 kg 

35 

5.8/40.9 

5.2/45.8 

8.0/29.9 

5.8/41.3 

Plug-in 

hybrid. 

12 kWh Li battery 
(200 Wh/kg) and 
ultracaps 

300 

70 kW with 

45 kW from 

caps 





Carbon/carbon 

40 kg 

45 

5.5/43.2 

5.0/47.7 

7.0/33.9 

5.5/42.9 

Hybrid carbon 

18 kg 

45 

5.8/41.2 

5.2/46.2 

7.9/30.2 

5.8/41.2 


the FUDS and ECE-EUD cycles and 20% on the Federal 
Highway and US06 cycles using the carbon/carbon 
ultracapacitors. The improvements are significantly 
less using the advanced hybrid carbon units because 
of their lower round-trip efficiencies. In the micro¬ 
hybrid designs, the rated engine power used was the 
same as that in the conventional ICE vehicle in order 
that the performance of the hybrid vehicle when the 
energy storage in the ultracapacitors is depleted would 
be the same as the conventional vehicle. The 
ultracapacitors were used to improve fuel economy 
with only a minimal change in vehicle acceleration 
performance. The control strategy used was the “saw¬ 
tooth” strategy discussed in the previous paragraph. As 
shown in Fig. 7, this resulted in a large improvement in 
average engine efficiency from 19% in the ICE vehicle 
to 30% in the micro-hybrid even though the electric 
motor had a peak power of only 6 kW. 

Additional computer simulations were made for 
higher motor power (up to 12 kW) and larger 
ultracapacitor energy storage (up to 50 Wh). It was 
found that the improvements in fuel economy were 
only marginally greater. However, using a motor 
power of 3 kW reduced the fuel economy improvement 
on the FUDS by more than 50%. Note from Table 10 


that the fuel economy improvements using the carbon/ 
carbon ultracapacitors were for all the cycles greater 
than those using the advanced hybrid carbon devices. 
This was the case because the round-trip efficiencies for 
the carbon/carbon units were 95-98% and those of the 
hybrid carbon units were 75-90% for the various driv¬ 
ing cycles. As shown in Table 2, the advanced hybrid 
carbon devices had higher energy density, but even 
though their power density for 95% efficiency was 
relatively high (1,050 W/kg), it was not proportionally 
higher - that is twice as high - as the carbon/carbon 
devices with lower energy density. These results show 
that it is essentially to develop high energy density 
ultracapacitors with proportionally higher power den¬ 
sity; otherwise their use in vehicle applications will be 
compromised. 

Charge-Sustaining Mild/Strong Hybrids The fuel 
economy simulation results for mild/strong charge- 
sustaining hybrids are also shown in Table 10 for a mid¬ 
size passenger car using both carbon/carbon and 
advanced hybrid carbon ultracapacitors. Using the car¬ 
bon/carbon ultracapacitor unit, the fuel savings are 
about 45% for the FUDS and ECE-EUD cycles and 
about 27% for the Federal Highway and US06 cycles. 























Energy Storage: Ultracapacitor 


E 


3487 




Engine operating efficiency for the ICE vehicle- Engine operating efficiency for the micro-hybrid- 

average engine efficiency .19 average engine efficiency .30 


E 


Energy Storage: Ultracapacitor. Figure 7 

A comparison of engine efficiencies for a conventional ICE vehicle and a micro-hybrid on the FUDS cycle using carbon/ 
carbon ultracapacitor 


These improvement values are higher than for the 
micro-hybrid, but not by as large a factor as might be 
expected. The prime advantage of the high-power elec¬ 
tric driveline in the mild/strong charge-sustaining 
hybrid is that it yields large fuel economy improve¬ 
ments even for high-power-requirement driving cycles 
like the US06. The fuel economy improvements using 
the advanced hybrid carbon ultracapacitor unit are not 
much less (5-10%) than those with the carbon/carbon 
unit even though the round-trip efficiency of the 
hybrid carbon unit is only 85-90% compared to 98% 
for the carbon/carbon unit. Since the weight/volume of 
the hybrid carbon unit is relatively small - 43% of that 
of the carbon/carbon unit, it appears that the mild/ 
strong charge-sustaining hybrid application is a better 
one for the advanced hybrid carbon technology than 
the micro-hybrid application. 

Plug-in Hybrid Vehicle Simulations Using 
Advanced Batteries and Ultracapacitors 

Control Strategy for Plug-in Hybrid Vehicles 
(PHEVs) 

For plug-in hybrid vehicles, batteries and 
ultracapacitors would be used in combination with 


the batteries providing the energy and the 
ultracapacitors the power. The powertrain schematics 
are shown in Fig. 1. The control strategy in the charge- 
depleting mode is to limit the power from the battery 
to the average power needed by the vehicle with the 
ultracapacitor providing the additional power during 
vehicle accelerations. The ultracapacitors also accept all 
the energy recovered during regenerative braking. If 
engine operation is needed, the “sawtooth” strategy is 
used with the ultracapacitors being recharged using 
engine power. In the charge-sustaining mode of oper¬ 
ation of the PHEV, the electric drive is operated using 
only the ultracapacitors for energy storage like that 
previously described for mild/strong charge-sustaining 
hybrid vehicles. The PHEV operating strategies are 
summarized in Fig. 8. 

Simulations of Plug-in Hybrid Vehicles Using 
Advanced Batteries and Ultracapacitors 

A detailed study of plug-in hybrids using advanced 
batteries is presented in [8]. The characteristics of the 
advanced batteries used in the simulations are given in 
Table 11. 

The battery and ultracapacitor units used in the 
simulations are given in Table 12. Simulations were 
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Low battery SOC & 
Low ultracap SOC 
OR 

High power request 
(> max. motor power) 
OR 

Low ultracap SOC & 
High power demand 
(> max. battery power) 


High ultracap SOC 
(> ultracap mean SOC) 
& 

Low power request 
(< max. motor power) 



HEV mode 

Engine optimum 
operation 



High power request 
(> opt. engine power) 
& 

Low battery SOC & 
Low ultracap SOC 


Low power request 
(< opt. engine power) 
& 

High ultracap SOC 
(> ultracap mean SOC) 



HEV mode 

Engine normal 
operation 



Energy Storage: Ultracapacitor. Figure 8 

Summary of the operating strategy used in the simulation of plug-in hybrid vehicles 


Energy Storage: Ultracapacitor. Table 11 Characteristics of present and future battery cell technologies for EVs and 
PHEVs 


Chemistry Anode/ 
cathode 

Cell voltage 
Max/nom. 

Ah 

Wgt. kg 

R mOhm 

EV Wh/ 
kg 

HEV W/ 
kg 95% 

EV W/ 
kg 75% 

Cycle life 
(deep) 

Thermal 

stability 

Present technology batteries 

Graphite/NiCoMn0 2 

4.2/3.6 

30 

0.787 

1.5 

140 

521 

2,060 

2,000- 

3,000 

fairly stable 

Graphite/Mn spinel 

4.0/3.6 

15 

0.424 

2.7 

127 

540 

2,120 

1,500 

fairly stable 

Future technology batteries 

Graphite/composite 

Mn0 2 

4/3.6 

5 

0.09 

20 

200 

250 

1,350 

- 

fairly stable 

Silicon carbon 
composites/ 
composite Mn0 2 

4/3.6 

20 

0.24 

4.5 

295 

621 

2,250 


fairly stable 

Rechargeable zinc-air 

1.3/1.15 

20 

60 

6.6 

385 

156 

616 

- 

very stable 

Present technology power devices 

supercapacitor 
activated carbon/ 
activated carbon 

2.7/1.35 

500 F 

0.068 

1.3 

5.5 

2,320 

11,600 

500 K 

Very stable 

Power battery lithium 
titanate oxide 

2.8/2.5 

4 

0.23 

1.15 

40 

1,310 

5,170 

20-50 K 

Very stable 


performed with the batteries alone and in combination 
with the ultracapacitors. The nominal energy storage 
unit voltage was 240 V in all cases with the maximum 
currents limited to about 300 A even in the cases of the 
batteries alone. In all cases, the batteries were depleted 
to 30% SOC from 100% SOC in the charge-depleting 
mode of vehicle operation. 


Simulation Results for the PHEVs All the PHEV 
simulations were performed using the following vehicle 
inputs: 

C D = 0.27, A F = 2.2 m 2 , f r = 0.008, test weight = 1,650 kg 
(approx.) 

Engine: Honda 1.3 1, iVTEC engine map, scaled to 
90 kW 
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Energy Storage: Ultracapacitor. Table 12 The advanced 
battery and ultracapacitor units used in the simulations 


Lithium-ion batteries 

EIG 

NiCoMn0 2 

30 kg 

20 Ah 
cells 

60 in 

series 

3.5 kWh 
usable 

Composite 

Mn0 2 

32 kg 

40 Ah 
cells 

60 in 
series 

4.6 kWh 
usable 

Silicone 

composite 

22 kg 

30 Ah 
cells 

60 in 
series 

4.6 kWh 
usable 

Zinc-air batteries 

Zn-air 

32 kg 

60 Ah 
cells 

180 in 

series 

9.5 kWh 
usable 

Carbon/carbon ultracapacitors 

Symmetric 

C/C 

20 kg 

1,350 F 
cells 

110 in 

series 

100 Wh 
usable 


Electric motor: Honda hybrid Civic AC PM 2006 effi¬ 
ciency map, scaled to 70 kW 
DC/DC inverter: constant efficiency 0.96 
Transmission: 5-speed manual (3.11, 2.11, 1.55, 1.0, 
0.71, FD = 3.95), automatically shifted in the 
model, but future models would incorporate the 
DCT (dual clutch transmission) as a convenient 
means to have smooth, fast shifting and to decouple 
the engine when it is not needed to provide power 

Fuel and Electricity Use Characteristics The simu¬ 
lation results are summarized in Tables 13 and 14 
showing results for vehicle operation in the charge- 
depleting and charge-sustaining modes of the PHEV. 

With the batteries in combination with the 
ultracapacitors, the PHEVs were able to operate in 
the all-electric mode down to the battery SOC = 30% 
on the FUDS and FDHW driving cycles. In all cases for 
the US06 driving cycle, the vehicle had blended oper¬ 
ation (engine and electric drive both needed) in the 
charge-depleting mode. The use of the ultracapacitors 
with the batteries permitted all-electric operation over 
a wide range of driving conditions with higher Wh/mi 
for all the driving cycles. Hence in the charge- 
depleting mode, the fuel economy (mpg) is higher by 
50-100% using the ultracapacitors for all the batteries. 
The fuel economy in the charge-sustaining mode is 
also higher for all the driving cycles using the 
ultracapacitors by 15-40%. The acceleration times of 


the vehicle were lower using the ultracapacitors than 
for the batteries alone. With the ultracapacitors, the 
acceleration times were 2.7 s for 0-30 mph and 6.9 s 
for 0-60 mph. For the batteries alone, the acceleration 
times varied somewhat with the battery used ranging 
from 2.9 to 3.2 s for 0-30 mph and from 8.6 to 9.8 s for 
0-60 mph. Hence in all respects, vehicle performance 
was improved using the ultracapacitors for all the 
batteries studied. 

Reductions in Battery Stress Using Ultracapacitors 

The current and voltage responses of the batteries with 
and without the ultracapacitors are shown in Figs. 9-12 
for the silicone carbon lithium-ion and the zinc-air 
batteries for the FUDS and US06 driving cycles. The 
effects of the load leveling of the power demand from 
the batteries using the ultracapacitors are evident in the 
figures. Both the average currents and the peak currents 
from the batteries are lower by a factor of 2-3 using the 
ultracapacitors. The minimum voltages of the batteries 
are significantly higher using the capacitors and the 
voltage dynamics (fluctuations) are dramatically 
reduced. Hence, the stress on the battery and resultant 
heating are much reduced. The simulation results in 
Figs. 9-12 also show that the ultracapacitors are uti¬ 
lized over a wide voltage range indicating that a large 
fraction of their usable energy storage (100 Wh) is 
being used to load level the batteries. This is only 
possible using a DC/DC converter between the battery 
and the DC bus. 

The simulation results also indicate that using 
ultracapacitors, batteries with a wide range of power 
characteristics can be used in PHEVs and also EVs 
without sacrificing vehicle performance and subjecting 
the batteries to high stress and resultant shorter cycle 
life. This could be especially important in the future as 
high energy density batteries such as zinc-air and pos¬ 
sibly lithium-air are developed. It is likely that those 
battery types will not have commensurate increases in 
usable power density and without ultracapacitors, 
the battery unit in PHEVs and EVs would be sized 
by the maximum power requirement (kW) rather 
than the range (mi)/energy requirement (kWh). This 
would significantly increase weight, volume, and 
the cost of the battery unit. It is also unlikely that the 
air electrode will have high charge acceptance 
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Energy Storage: Ultracapacitor. Table 13 Simulation results for the advanced batteries with ultracapacitors 



Cycle 

Range 

mi. 

kW max. 
control 

kW 

max bat. 

Eff. 

Bat. 

kW 

max. Cap. 

Eff. 

Cap. 

Wh/ 
mi Bat. 

Operat. 

mode 

mpg 

20mi 

mpg 

40mi 

Ch. Sus. 
HEV mpg 

Compos. 

Mn02 

FUD 

22 

40 

18 

0.94 

40 

0.97 

215 

AE 

none 

97 

52.8 

32 kgbat 

HW 

20 

45 

18 

0.91 

45 

0.96 

227 

AE 

none 

109 

56.3 

20 kgcap 

US06 

30 

68 

21 

0.91 

68 

0.94 

180 

blended 

71.9 

56 

38.3 

Si Carb/ 

Compos. 

Mn02 

FUD 

20 

40 

18 

0.94 

40 

0.97 

220 

AE 

none 

99 

52.8 

22 kgbat 

HW 

20 

45 

19 

0.91 

45 

0.97 

225 

AE 

none 

110 

56.8 

20 kgcap 

US06 

30 

68 

21 

0.91 

68 

0.94 

190 

blended 

71.1 

52 

38.4 

Rech. 

Zn-air 

FUD 

40 

45 

19 

0.87 

45 

0.97 

228 

AE 

none 

none 

54.5 

32 kgbat 

HW 

38 

45 

19 

0.81 

45 

0.97 

242 

AE 

none 

none 

57.7 

20 kgcap 

US06 

66 

68 

21 

0.82 

68 

94 

149 

blended 

62.4 

60 

38.8 


(1) Weight of cells only 


Energy Storage: Ultracapacitor. Table 14 Simulation results for the batteries alone 


Battery Type (1) 

Cycle 

Range 

mi. 

kW max. 
control 

kW max 
bat. 

Eff. 

Bat. 

Wh/mi 

Bat. 

Operat. 

mode 

mpg 

20mi. 

mpg 

40mi 

Mpg Ch. 
sus. HEV 

EIG 

NiCoMn 

FUD 

27 

30 

30 

0.94 

125 

blended 

134 

85 

47 

30 kg 

HW 

24 

20 

20 

0.93 

137 

blended 

110 

87 

47 


US06 

57 

58 

58 

0.88 


blended 

48 

45 

37 

Compos. Mn02 

FUD 

36 

30 

30 

0.92 

135 

blended 

134 

104 

46.9 

32 kgbat 

HW 

31 

20 

20 

0.91 

147 

blended 

167 

113 

46.6 


US06 

64 

58 

58 

0.87 

92 

blended 

48 

48 

34.1 

Si Carb/Compos. 
Mn02 

FUD 

35 

30 

30 

0.93 

138 

blended 

138 

106 

46.9 

22 kgbat 

HW 

32 

20 

20 

0.92 

148 

blended 

169 

114 

46.9 


US06 

64 

58 

58 

0.88 

87 

blended 

48 

48 

35.7 

Rech. Zn-air 

FUD 

66 

30 

30 

0.84 

139 

blended 

139 

137 

39.4 

32 kgbat 

HW 

63 

20 

20 

0.83 

156 

blended 

169 

169 

41.1 


US06 

93 

36 

36 

0.72 

101 

blended 

48.5 

48.5 

30.1 


(1) Weight of cells only 
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capability and thus regenerative braking performance 
approaching that of ultracapacitors or even lithium- 
ion batteries. This is another advantage of the use of 
ultracapacitors with the air-electrode batteries. 

Fuel Cell Vehicle Simulations Using 
Ultracapacitors 

Fuel Cell Powertrain Configurations and Power Split 
Strategies 

A fuel cell vehicle powertrain consists of three elements: 
(1) a fuel cell unit that consists of a fuel cell stack, 
air and hydrogen supply, and water and thermal man¬ 
agement systems; (2) an energy storage unit 
(ultracapacitors or batteries) that can store the electric¬ 
ity generated by the fuel cell as needed; (3) an interface 
electronics unit that controls the power split between 
the fuel cell and the energy storage unit. Fuel cells in 
combination with energy storage can create high power 
for vehicle traction with fast dynamic response, effi¬ 
cient capture of regenerative braking energy, and 
reduced stress on the fuel cell due to the elimination 
of high current pulses. Potentially the use of energy 
storage can also permit downsizing the fuel cell unit 
in some applications. As indicated in Fig. 13, there are 
several approaches to hybridizing the fuel cell system. 
These various powertrain configurations for fuel cell 
vehicles have been simulated [28-30] to determine the 



c 


effect of the configuration used on vehicle hydrogen 
consumption and the dynamic stress on the fuel cell 
stack. Of particular interest in this entry are 
powertrains using ultracapacitors in place of batteries. 

The key considerations in configuring the 
powertrain are whether interface electronics is used 
and where the electronics is positioned relative to the 
fuel cell and the energy storage unit. A powertrain 
without interface electronics is possible using 
ultracapacitors, but not with batteries. For fuel cell 
vehicles with ultracapacitors coupled directly in paral¬ 
lel with the fuel cell, a power diode is employed to 
prevent the flow of reverse current during braking 
periods. 

In systems without interface electronics, the power 
sharing between the fuel cell and ultracapacitor is 
determined by the current-voltage characteristics of 
fuel cell and ultracapacitor because the voltages must 
be equal at all times. 

Vcap — Vfc Vdiode 

The voltage response of the fuel cell on various 
driving cycles is modeled in detail in [28-30]. 

Fuel cell vehicles with DC/DC interface electronics 
placed between the fuel cell and energy storage units 
have more flexibility for balancing the power flow 
between the fuel cell and the energy storage. The gen¬ 
eral control objective is to operate the fuel cell unit only 
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Energy Storage: Ultracapacitor. Figure 13 

Powertrain configurations for hybridized fuel cells vehicles 
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in its high efficiency region and as close to constant or 
slowly varying power as possible. Two different control 
strategies, power assist and load leveling, can be 
employed [30]. Both control strategies maintain the 
SOC of the energy storage unit within a specified 
range. The power assist control strategy sets the current 
demand for the fuel cell based on the fuel cell voltage 
and the energy storage SOC. If the fuel cell voltage 
remains relatively high, it provides most of the current 
to the electric drive. When the fuel cell voltage becomes 
low and the energy storage SOC is high, the energy 
storage unit provides a large fraction of the current. In 
the load-leveled control strategy , the fuel cell provides 
relatively steady power and the energy storage unit 
provides transient power. The fuel cell power com¬ 
mand is the vehicle demand averaged over a specified 
time period such as 60 s. 


Simulation Results for Fuel Cell Vehicles 

The performance and fuel consumption of midsize fuel 
cell vehicles were simulated for the different powertrain 
configurations and control strategies using the vehicle 
models developed at UC Davis [28-30]. Simulations 
were performed for the hybrid fuel cell vehicles with 
a dynamic fuel cell system operating in the optimal 
varying back pressure mode. The simulation parame¬ 
ters are listed in Table 15. 

The simulations were performed over the FUDS 
and US06 driving cycles. The simulation results are 
discussed in detail in [30]. It was found that the use 
of energy storage significantly reduced the stress on the 
fuel cell stack by reducing the peak currents as well as 
the current fluctuations. There were no significant dif¬ 
ferences in the stack currents using batteries or 
ultracapacitors if interface electronics was positioned 
between the energy storage unit and the fuel cell bus; 
the results using ultracapacitors without interface elec¬ 
tronics were nearly as advantageous as those with 
electronics. 

The use of energy storage in the fuel cell vehicles 
resulted in improved fuel economy (reduced hydrogen 
consumption). As shown in Figs. 14 and 15, the 
improvements increase for larger energy storage units 
for both ultracapacitors and batteries. In general, the 
magnitudes of the improvements are much smaller 
than in hybrid-electric vehicles. This is due to the 


Energy Storage: Ultracapacitor. Table 15 Vehicle simu¬ 
lation parameters 


Vehicle and system parameters 

Drag coefficient 

0.3 

Frontal area (m 2 ) 

2.2 

Rolling resistance 

0.01 

Vehicle hotel load (kW) 

0.3 

Vehicle mass without energy storage (kg) a 

1,500 

Electric motor (kW) 

75 

Fuel cell stack and auxiliaries 

Max. net power (kW) 

87.6 

Gross power (kW) 

106 

Number of cells 

440 

Cell area (cm 2 ) 

510 

Compressor (kW) 

17.2 


a Weight includes all powertrain components except energy 
storage 


UC coupled to FC DC-Link via a converter 

1.30 


1.25 

® 1.20 
CD 

1 1.15 

CL 

E 

1.10 

E 

§ 1.05 

CD 

1 1.00 
LL 

0.95 


0.90 

60 80 100 120 140 160 180 200 220 

Ultracapacitor capacity (Wh) 

Energy Storage: Ultracapacitor. Figure 14 

Fuel economy improvements using ultracapacitors 



higher efficiency of the fuel cell and its smaller variation 
with power fraction than is the case for engines. Most 
of the improvement in fuel economy for fuel cell vehi¬ 
cles using energy storage is due to energy recovery 
during regenerative braking. Note from Figs. 14 
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Li-ion battery coupled to FC DC-link via a converter 



Battery capacity (Ah) 

Energy Storage: Ultracapacitor. Figure 15 

Fuel economy improvements using lithium batteries 

and 15 that the fuel economy improvements with 
ultracapacitors are slightly larger than with batteries 
primarily due to the higher efficiency (lower losses) of 
the ultracapacitors. 

Economic Considerations 

Reducing the present high cost/price of ultracapacitors 
is a key issue in achieving high market penetration in 
the future especially of midsize and large devices. There 
are many applications for which ultracapacitors are 
presently precluded or even seriously considered 
because they remain too expensive even though their 
selling price has decreased significantly in recent years. 
The cost of manufacture of any product is closely tied 
to production volume with the cost decreasing rapidly 
with increased volume up to relatively high production 
rates. Sales of capacitors in the many millions of units 
per year are necessary to reduce the unit costs to levels 
at which large markets can develop. Semi-automated 
production facilities presently exist at a number of 
companies for ultracapacitors of all sizes. In fact, pro¬ 
duction capabilities exceed sales volumes for most 
devices and that is the reason the price of devices has 
decreased markedly in recent years. It is common to 
speak of the price of devices in terms of cents per Farad 
(cents/F) or $/Wh stored. It is easier to interpret the 
price information on the cents/F basis as it does not 


depend on the cell voltage or what fraction of energy 
stored can be used in a particular application. For 
example, for a 10 F device, if the price is quoted as 
10 cent/F, the device cost would be $1. Similarly, 
a 2,500 F device would cost $25 at 1 cent/F. 

The cost to manufacture a carbon/carbon device 
depends on the material and production costs. At the 
present time, material costs are high. The cost of car¬ 
bon suitable for use in ultracapacitors can be as high as 
$ 100/kg with the average price being in the $30-50/kg 
range. The cost of the electrolyte solvent is also high in 
the range of $5-10/1 for both propylene carbonate and 
acetonitrile. The ionic salts that dissociate in the sol¬ 
vent into the positive and negative ions that move into 
and out of the double layer in the microporous carbon 
to store the energy are also expensive being $50-100/ 
kg. Since the analysis of ultracapacitors is straightfor¬ 
ward, material costs can be calculated [31, 32] with 
good accuracy. The result of a typical costing exercise 
is shown in Table 16. Note the strong dependency of the 
cents/F and $/Wh unit costs for the device on the unit 
material costs. Presently the price of ultracapacitors is 
high because both the material and manufacturing 
costs are high. With more automated production and 
reduced material costs, it is anticipated that the price of 
ECCs in high volume can be in the range of 1-2 cents/F 
for small devices and 0.25-0.5 cents/F for large devices 
like those needed for vehicle applications. 

Ultracapacitors cannot compete with batteries in 
terms of $/Wh, but they can compete in terms of $/kW 
and $/unit to satisfy a particular vehicle application. 
Both energy storage technologies must provide the 
same power and cycle life and sufficient energy (Wh) 
for the application. The weight of the battery is usually 
set by the system power requirement and cycle life and 
not the minimum energy storage requirement. Satis¬ 
fying only the minimum energy storage requirement 
would result in a much smaller, lighter battery than is 
needed to meet the other requirements. On the other 
hand, the weight of the ultracapacitor is determined 
by the minimum energy storage requirement. 
The power and cycle life requirements are usually 
easily satisfied. Hence, the unit can be a more opti¬ 
mum solution for many applications and its weight 
can be less than that of the battery even though 
its energy density is less than one tenth that of the 
battery. 
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Energy Storage: Ultracapacitor. Table 16 Material costs for a 2.7 V, 3,500 F capacitor 3 


Carbon 

gmC/dev. 

$/kg 

Electrolyte ACN 

$/kg salt 

Device 

Unit 

Costs 

$/kW 

Ct/F 

F/gm 

$/i 

Total mat. $ 

$/kg 

$/Wh 

75 

187 

50 

10 

125 

17.0 

29 

6.4 

29 

0.48 

120 

117 

100 

10 

125 

15.5 

26 

6 

26 

0.44 

75 

187 

5 

2 

50 

3.6 

6.0 

1.3 

6 

0.10 

120 

117 

10 

2 

50 

2.5 

4.2 

0.93 

4.2 

0.070 


a 4.5 Wh/kg, 1,000 W/kg-95% eff. 


Energy Storage: Ultracapacitor. Table 17 Relationships between capacitor and battery costs 


Battery cost 
$/kWh 

Battery 
cost 3 $/kW 

Ultracap cost 
cents/F V cap =2.6 

Ultracap cost 
cents/F V cap =3.0 

Ultracap cost b 
$/kWh V cap =3.0 

Ultracap cost $/kW 
Vcap =3-0 

300 

30 

0.25 

0.34 

3,626 

7.3 

400 

40 

0.34 

0.45 

4,800 

9.6 

500 

50 

0.42 

0.56 

5,973 

11.9 

700 

70 

0.59 

0.78 

8,320 

16.6 

900 

90 

0.76 

1.0 

10,667 

21.3 

1,000 

100 

0.84 

1.12 

11,947 

23.9 


a battery 100 Wh/kg, 1,000 W/kg 
b capacitor 5 Wh/kg, 2,500 W/kg 


Consider the example of a charge-sustaining hybrid 
like the Prius. If the energy stored in the capacitor unit 
is 125 Wh and that in the battery unit is 1,500 Wh, the 
unit costs of the capacitors and battery for equal unit 
cost are related by 

($/Wh) cap = ,012($/kWh) bat 

The corresponding capacitor unit costs in terms of 
cents/Farad and $/kWh are given by 

(cents/F) cap = .125 * 10“ 3 * ($/kWh) babf V cap 2 

($/kWh) cap = 9.6 * 10 4 (cents/F) cap /V r 2 

and shown in Table 17 for a range of battery costs. 

The results shown in Table 17 indicate that for the 
charge-sustaining hybrid application, ultracapacitor 
costs of 0.5-1.0 cents/F are competitive with lithium 
battery costs in the range of $500-700/kWh. Note also 
that the $/kW cost of the ultracapacitor are about one 
fourth those of the batteries. 


Future Directions of Ultracapacitor Development 
and Vehicle Applications 

Ultracapacitor Developments 

Ultracapacitors for vehicle applications - that is devices 
having a capacitance greater than several hundred 
Farads - have been under development for over 
20 years. This development is expected to continue 
with improvements in energy density as special carbons 
developed especially for the ultracapacitor appl¬ 
ication become available. Energy densities of about 
10-15 Wh/kg can be expected using the advanced 
carbons and electrolytes permitting voltages of 
3-3.5 V. The cost of ultracapacitors using activated 
carbon are presently 1-2 cents/F. It is expected that 
the cost will continue to decrease as the volume of 
production of ultracapacitors continues to increase. If 
the cost can be reduced to 0.25-0.50 cents/F, the market 
for ultracapacitors in vehicle applications will become 
large (millions of vehicles per year using the devices). 
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Development of ultracapacitors using pseudo- 
capacitance or Faradaic materials in at least one 
electrode will continue. These devices can use either 
activated carbon or an advanced carbon in one of the 
electrode. The energy density of these advanced devices 
could be greater than 20 Wh/kg. The key issues with the 
advanced devices are their power capability (W/kg for 
95% pulse power efficiency) and cycle life. Research to 
date has indicated it is difficult to achieve large 
improvements in energy density without significant 
sacrifices in cycle life and power capability. 

Vehicle Applications of Ultracapacitors 

There have been numerous demonstrations of the use 
of ultracapacitors in vehicle applications, but except for 
a limited number of transit buses and heavy-duty 
trucks produced for sale, there has been no significant 
market for ultracapacitors for hybrid vehicles. This may 
be about to change (2011) as several auto companies 
are discussing the inclusion of ultracapacitors in 
micro-hybrids combined with lead-acid batteries. The 
success of this application is highly dependent on the 
cost of the ultracapacitors and will require a continuing 
decrease in their cost. If the energy density of the 
carbon/carbon devices can be increased to 8-10 Wh/kg 
with continuing cost reductions to 0.5-1.0 cents/F, it 
seems likely ultracapacitors will be used in charge- 
sustaining HEVs to further improve the fuel economy 
of those vehicles. This could be a very large market for 
ultracapacitors. 

Applications of ultracapacitors in vehicles further 
in the future could involve their use in fuel cell- 
powered vehicles and electric and hybrid vehicles 
using advanced, high energy density (>250 Wh/kg) 
lithium batteries, which may not have corresponding 
higher power capability and need ultracapacitors to 
meet vehicle power requirements. These applications 
can be the focus of laboratory studies and field dem¬ 
onstrations in the next 5-10 years. 

Summary 

In this entry, various aspects of the use of ultracapacitors 
alone and in combination with batteries are discussed in 
relation to general system considerations and the perfor¬ 
mance of the energy storage systems in charge-sustaining 
and plug-in hybrid vehicles and fuel cell-powered 


vehicles. The performance of ultracapacitors and lith¬ 
ium batteries is reviewed based on test data taken 
at the University of California-Davis. It was found 
that ultracapacitors are commercially available with 
energy densities of 4.5-5 Wh/kg and power density of 
1-2 kW/kg with a pulse efficiency of 95%. Lithium 
batteries of various chemistries were also tested. It 
was found that batteries consisting of graphite and 
manganese oxide/spinel had energy densities of 
140-160 Wh/kg and power densities of 0.5-1.0 kW/kg 
with pulse efficiencies of 95%. High-power lithium 
batteries using lithium titanate oxide in the negative 
electrode were found to have an energy density of about 
40 Wh/kg and a power density of 1.3 kW/kg with pulse 
efficiencies of 95%. Comparisons of the power densi¬ 
ties of ultracapacitors and lithium batteries indicated 
that the best ultracapacitors had power densities 
a factor of 2-3 higher than the high-power batteries, 
but there are some batteries with higher power capa¬ 
bility than the lower power capability capacitors. For 
use in hybrid vehicles, high power density is very 
important for ultracapacitors. 

The application of ultracapacitors in hybrid-electric 
vehicles was considered in detail utilizing vehicle sim¬ 
ulation results. The use of ultracapacitors alone is ana¬ 
lyzed for micro- and mild/full charge-sustaining hybrids 
and in combination with high-energy density batteries 
for plug-in hybrids. It was found that ultracapacitors 
could be used in place of batteries in the charge- 
sustaining hybrids resulting in fuel economy improve¬ 
ments of 25-50% even for relatively small electric drive 
systems of 6-10 kW. The ultracapacitor units only stored 
50-100 Wh of usable energy and their efficiency was high 
being 95-98%. In all cases, the ultracapacitors were used 
with interface electronics that permitted the use of a large 
fraction of their stored energy. 

Vehicle simulations were also performed for plug-in 
hybrid vehicles that used ultracapacitors combined 
with advanced, high energy density batteries. The bat¬ 
teries had energy densities of 200-400 Wh/kg, but only 
moderate power capability of 300-1,000 W/kg for 90% 
efficient pulses. The simulation results indicated that 
the advanced batteries combined with ultracapacitors 
could provide all-electric operation on the FUDS 
and Federal Highway cycles in the charge-depleting 
mode and excellent fuel economy in the charge- 
sustaining mode. Vehicle operation with the advanced 
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batteries without the ultracapacitors required blended 
operation on all driving cycles. Comparisons of the 
current/voltage/power profiles of the batteries with 
and without the ultracapacitors indicated the peak 
currents and thus the stress on the batteries were 
reduced by about a factor of 3 using the ultracapacitors. 
This reduction is expected to lead to a large increase in 
battery cycle life. 

Vehicle simulation results are also discussed for fuel 
cell vehicles using ultracapacitors or batteries for 
energy storage. It was found that the vehicle and fuel 
cell system functioned essentially the same with 
ultracapacitor or batteries. In the case of fuel cell vehi¬ 
cles, the ultracapacitor unit could be used without 
interface electronics with the resultant power split 
between the ultracapacitors and the fuel cell being 
self-controlled by the voltage-current characteristics 
of the respective devices. The fuel economy (hydrogen 
consumption) improvements using energy storage 
were 10-15% for the FUDS cycle and 5-10% for the 
US06 cycle with ultracapacitors and a few percent less 
using lithium batteries. 

The long-term costs of both lithium batteries and 
ultracapacitors are uncertain. An analysis of 
ultracapacitor unit costs indicated that a battery cost 
of $500/kWh and ultracapacitor cost of 0.5 cents/F 
would result in a comparable energy storage unit cost 
for charge-sustaining hybrid vehicles. 
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Glossary 

Breakthrough time The time taken for a tracing ele¬ 
ment (chemicals) to travel through a specified 
media (a reservoir) indicating a degree of channel¬ 
ing or dispersion of injected fluid passing through 
the reservoir. 

EGS Enhanced or Engineered Geothermal system; a 
geothermal system, a geothermal system engineered 
to mimic a natural system (hydrothermal) so that 
energy can be extracted from hot rocks at great 
depth. 

Granite Granite is a common and widely occurring 
type of intrusive, felsic, igneous rock. Granites usu¬ 
ally have a medium- to coarse-grained texture and 
can contain radiogenic material which produces 
heat in the rock. 

Igneous rocks Igneous rock is formed through the 
cooling and solidification of magma or lava. Igne¬ 
ous rock may form with or without crystallization, 
either below the surface as intrusive (plutonic) 
rocks or on the surface as extrusive (volcanic) rocks. 

In situ stress Forces that exits in the crustal plate and 
the three mutually orthogonal components are cat¬ 
egorized as vertical, minimum horizontal, and 
maximum horizontal. It plays a very important 
part in the development of an EGS reservoir. 

Joint network Term used to collectively describe frac¬ 
tures within a rock mass characterised by features 
such as fracture orientation, density, spacing, length 
and mechanical stiffness. 

Permeability Permeability is defined by as a measure of 
flow through spacing between the two faces of 
a fracture open enough to allow fluid flow through it. 

Reservoir A matrix of fractures are manipulated by 
injecting fluid under pressure for fractures to slide 
against each other, leaving a permanent increased 
aperture in the fracture so that fluid can pass 
through it to recover heat. 

Shear failure A terminology used when two sides of 
a fracture slip over each other with minor perma¬ 
nent displacement. 


E 





3502 


Engineered Geothermal Systems, Development and Sustainability of 



Tensile failure A terminology used to create a new 
fracture by splitting the rock mass using fluid injec¬ 
tion. The fracture remains open while the pressure 
is applied but closes once the pressure is taken off, 
leaving no residual gap between the two new faces 
of the fracture. 

Thermal drawdown A process whereby the reservoir 
is progressively cooled over time by the passing 
through of cooler re-injected fluid. 

Definition of Engineered Geothermal Systems 

Engineered Geothermal System (EGS) is a relatively new 
technology with tremendous potential for providing 
heat and power, as well as helping to address the issue 
of reduction of C0 2 in the environment. The technology 
is complex, and it has taken some time for a series of 
research projects globally to understand the physical 
processes involved, to develop supporting technologies 
such as high temperature instrumentation, numerical 
models, and to validate the concept. In view of EGS 
being a new technology with very limited operating 
history, the issue of sustainability is difficult to address. 
It is important to assess the sustainability issue based on 
the current state of knowledge whilst also recognizing 
the fact that some existing data are inconclusive. Obser¬ 
vations and conclusions reached at many of the EGS sites 
worldwide have been incorporated to give a broad-based 
qualitative assessment and to reduce the bias. 

The most advanced and near commercial-scale EGS 
project is the European project at Soultz-sous-Forets, 
north of Strasbourg in France. Knowledge gained from 
over 30 years of research carried out at other EGS pro¬ 
jects formed the basis upon which the European project 
was built [1, 2]. To a large extent, the preceding 
national EGS projects were at a level of university 
research and subsequently remained starved of funds. 
This made it difficult to demonstrate the true potential 
of EGS. Establishing a European Project [3,4] provided 
a big boost and allowed work to proceed on a realistic 
scale to address the technical and economic targets. The 
first successful circulation of an EGS system was 
achieved at Soultz in 1997, at a depth of around 3,600 
m, and indicated that technically and economically the 
technology was promising [5]. Data from Soultz are 
drawn upon in this paper to support the case why this 
technology needs to move to a commercialization 


scenario, and how it can be scaled up to produce 
a large, sustainable power output to make it acceptable 
as a serious contender as an energy provider. It is 
important to re-emphasize that discussions and argu¬ 
ments for the sustainability of EGS presently remain 
theoretical as there are very few operating plants at 
present. However, our judgement and experience has 
been used to put forward sustainability scenarios based 
on the history of the development of this technology 
and observations to date. 

Introduction 

Global demand for energy is expected to continue to 
rise, but on the timescales of decades to centuries, the 
supply of traditional energy resources derived from 
hydrocarbon are unlikely to keep pace with the increas¬ 
ing demand. Additionally, there is increasing awareness 
of global warming, the political dimension to the gen¬ 
eration of C0 2 , and the importance of domestic energy 
security. Nations, industry, and the public are aware 
that low-emission energy resources will have to play an 
increasingly important role as people both choose and 
are forced to move away from hydrocarbon-based 
energy resources. The effect of digressions between 
the increasing demand and the limited supply of hydro¬ 
carbon-based energy will manifest itself as an increas¬ 
ing price of energy. The increasing prices should, in 
principle, make some renewables cost competitive. 

The energy potentially available from Geothermal 
resources generally, and EGS in particular, is huge; 
almost beyond comprehension. Theoretical calcula¬ 
tions indicate that cooling just 1 km 3 volume of rock 
from 200°C to 180°C (i.e., by 20°C) could provide 
enough thermal energy to power an electric generator 
of 10 MWe capacity for 20 years, the equivalent of 1.3 
million tons of oil. For example, on a slightly larger 
scale (national scale), the estimate of the Technical 
Potential for EGS in Australia is around 6,800 GWe 
[6], which equates to 136 times the total current 
installed capacity in Australia from all energy sources. 

Basic Concept of EGS 

The concept of extracting heat energy from the deep, 
hot rock mass in the earth using EGS technology is 
relatively simple [7]. Drilling to an appropriate depth 
can provide access to any desired temperature, and 
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a reservoir (heat exchanger) can be created by pumping 
cold fluid at high pressure and flow rate from the 
surface to open up existing fractures, a process known 
as “reservoir stimulation”. These fractures remain open 
permanently, giving the stimulated rock mass an 
enhanced permeability. A second, and sometimes 
a third, well is drilled into the heat exchanger some 
distance (typically around 600 m) from the first well. 
Cold water is pumped down the injection well, the 
water flows through the enhanced permeable zone 
(heat exchanger) where it heats up, and the hot water 
is returned to the surface via a production well in order 
to drive a turbine to produce electricity and/or space 
heating. The cooled fluid is then pumped down again 
and reheated, thus forming a continuous closed circuit 
loop (Fig. 1). 

Although this was the initial concept of EGS, the 
term “EGS” now also encompasses the exploitation of 
fluid flow at depth in faults with favorable orientation 
within the prevailing stress field [8, 9]. Experience and 
knowledge to date indicate that some large faults that 
are approximately aligned in the maximum horizontal 
stress direction at depth are likely to be open and are 
able to deliver large flow rates of hot fluid for power 
generation [10]. A production well is drilled to 


orthogonally intersect at depth a large fault that is 
striking in the direction of maximum horizontal stress. 
Significant flow rates might be immediately achievable. 
If not, then stimulations may be necessary to enhance 
the flow rate by decreasing the flow impedance. 
A similar technique is used to drill a second well (injec¬ 
tion well) some distance away and offset from the first 
fault. In planning such a strategy, it is important to 
recognize that stimulation of either or both wells may 
be necessary to decrease the natural impedance. An 
EGS project based on this model has already been 
commercialized in Landau, Germany (www.geox- 
gmbh.de/en). The EGS reservoir has been producing 
for around 3 years with no indication of reduction in 
the flow rate or thermal drawdown. The data from the 
project is not widely circulated due to the commercial 
nature of the project. 

Numerical modeling of the stimulated deep reservoir 
at Soultz reveals that a reservoir of such dimensions 
should last for around 20+ years [11] before thermal 
drawdown reducing the rock temperature below com¬ 
mercial levels. But the results of the numerical models 
are yet to be confirmed through long-term circulation. 

Any, even superficial, estimate of global EGS potential 
reveals a huge store of thermal energy at relatively shallow 
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depths in the Earth s crust. Debate often surrounds issues 
such as whether the depth for estimating EGS potential 
should be limited to 5,000 or 10,000 m. A seminal report 
from the Massachusetts Institute of Technology in the 
USA [12] supported 10,000 m as the maximum depth 
of exploitation for EGS with a view that the technology 
to exploit energy at this depth is within the grasp of the 
hydrocarbon industry and therefore could be adapted 
to work for EGS in the future. 

Global Potential of EGS 

EGS technology has advanced significantly in the last 
20 years. Some of the main questions now raised by 
potential investors are: 

• What is the worldwide potential for EGS? 

• Can present technology be scaled up to produce 
significant amounts of power? 

• What are the risks associated with this technology? 

• Are EGS reservoirs sustainable? 

Taking notice of the concern of potential investors, 
one of the tasks under the auspices of “Geothermal 
Implementing Agreement” of the International Energy 
Agency (IEA/GIA) has been to assess the global poten¬ 
tial for EGS. To this end, the Google organization has 
sponsored a mechanism to quantify the global poten¬ 
tial by developing a Protocol, with the intention that it 
be adopted internationally [13]. Other questions are 
addressed later on in the paper. 

Brief History of the Development of EGS 

EGS projects have been activated in many countries, 
but only four are mentioned here to summarize the 
development of this technology. 

The concept of extracting heat from hot, but low 
permeability, rocks at great depth and converting this 
to electricity came from Los Alamos, USA, in the early 
1970s. The concept was called Hot Dry Rock (HDR), 
and a research project was established in a caldera near 
Los Alamos, New Mexico, USA [14, 15]. The concept 
relied on some basic assumptions at the time; primarily, 
that the system had to be in an igneous basement rock 
with low permeability so that it could be stimulated using 
high-pressure fluid injection to enhance permeability 
with minimal chance of losing water in the far field 
during circulation. The method of enhancing the 


permeability of the potential heat exchanger was 
borrowed from the hydrocarbon industry. It was felt 
that this technique could, in principle, be replicated 
worldwide to provide power and heat wherever 
necessary. 

Although the concept appeared to be relatively sim¬ 
ple, extensive supporting technologies had to be devel¬ 
oped to overcome difficulties such as high temperatures 
encountered during drilling, casing and cementing, 
characterization of the development of the reservoir, 
and establishing the physics associated with it [2]. 

In the mid-1970s, a research project was established 
in the UK at the Rosemanowes site in Cornwall [16] to 
focus on and characterize the physics associated with 
the creation of an underground heat exchanger. The 
project targeted a much shallower depth and lower 
temperature than the Los Alamos site. This was done 
in recognition of the fact that the high temperature and 
complex stresses regime had created insurmountable 
difficulties for Los Alamos. The project at the 
Rosemanowes site was designed so that the physical 
processes involved in the development of an EGS res¬ 
ervoir could be easily observed and understood. 

The Rosemanowes project addressed many of the 
uncertainties in the physical processes involved in the 
creation and characterization of an EGS reservoir, and 
in particular the influence of the prevailing tectonic 
stress regime on the fluid flow and water losses 
[17-19]. A 5-year-long circulation test was also carried 
out (the longest circulation so far in any EGS project), 
stimulations were carried out using various materials 
(explosives, gel with various viscosities, water, etc.). 
Attempts to define a number of economic targets for 
assessing the commercialization potential of the tech¬ 
nology were also carried out [20]. 

By the early 1980s, as the definition and the concept 
of EGS grew broader, it was recognized that “An EGS 
system is any system where reinjection is necessary to 
extract the heat at a commercial rate for a prolonged 
period ” [21]. If one accepts this definition, then a large 
proportion of hydrothermal systems may fall within the 
realms of EGS. Additionally, various names were given to 
the same concept (Hot Dry Rock) for political accep¬ 
tance and because of variations in the in situ rock char¬ 
acteristics. Alternative names include “Enhanced 
Geothermal Systems,” “Hot Fractured Rocks,” and 
“Hot Wet Rocks,” but, recently, all of these have been 
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grouped into a more coherent title, called Engineered 
Geothermal Systems. This recognizes the fact that there 
is not a single solution for creating a reservoir at depth 
due to the natural variation in rock properties, and that 
an engineered solution will need to be found to address 
local conditions based on past experience. 

The Soultz project was established in late 1988 
about 50 km north of Strasbourg in Alsace, France, 
near the German border [3, 22, 23]. The project was 
established as a European project, supported by the 
European Commission, incorporating Britain, France, 
and Germany (with Britain leaving the program 
around 1993). The program at Soultz had an interna¬ 
tional dimension and sufficient funds to take EGS 
technology to the next step in development, i.e., bring 
EGS closer to a commercial prospective by addressing/ 
improving economic targets. The project successfully 
addressed some of the economic targets by initially 
creating an EGS reservoir at 3.5 km depth [24], and 
later at 5,000 m depth. Its international status (Fig. 2) 
helped to accumulate the knowledge gained in earlier 
projects and bring it into focus to address the further 
development required for commercial exploitation 
[24-28]. 

Economic Targets for Commercial Acceptance 

A number of economics models have been generated in 
the past to assess the parameters necessary to make EGS 


commercially viable. One such economic model was 
prepared by Shock [29] for the UK DoE. That study 
identified a number of parameters (Fig. 3) that need to 
be demonstrated and quantified in real terms to gain 
confidence in a created EGS reservoir. These are: 

Quantity of Power Output: 5-6 MWe This depends 
on the combination of output temperature, the deliv¬ 
ered flow rate, and the density of the fluid. For example, 
at 200°C, one would require a flow rate of ~751/s using 
fresh water. The highest temperature accessed to date in 
an EGS project is 278°C at the Geodynamics site in 
Australia [30] . The highest flow rate produced to date is 
~100 1/s at the Landau site, Germany. 

Life of an EGS Reservoir at the Above Flow Rate In 

principle, a reservoir is designed to last around 20 years 
with a temperature drawdown of 10%. For example, if 
the initial temperature at the inlet of a power plant is 
200° C, a temperature drawdown of 10% will bring this 
to 180°C. Once the system reaches around 180°C, the 
plant will be less efficient and is likely to be shut down, 
but this does not mean that there is no heat energy left 
in the rock mass. 

For heat to be extracted efficiently, cold injected 
fluid has to come into contact with the hot rock so 
that heat can be transferred from the rock to the fluid. 
Numerical models suggest that about 1 million square 
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Geothermal Systems, Engineered or Enhanced (EGS), Accumulation and dissemination of knowledge at the European EGS 
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ECONOMIC REQUIREMENTS 


DRIVEN BY ECONOMICS: 

Taraet 5-6 MWe/module 

• 

LIFE OF THE SYSTEM: 

-20 Years 

• 

TEMP/DEPTH OF THE WELLS: 

-200 C 

• 

SEPARATION BETWEEN WELLS: 

-600 m 

• 

PRODUCTION FLOW RATE: 

-75 Ko/s 

• 

FLOW IMPEDANCE: 

0.1 MPa/l/s 

• 

WATER LOSS: 

-10% MAX 

• 

THERMAL DRAWDOWN: 

- 10% 

• 

CONTACT SURFACE AREA: 

-10 million m 2 

• 

RESERVOIR ROCK VOLUME: 

-300 million m 3 

• 

INTEREST RATE FOR THE CAPITAL: 

-5% 

• 

SUPPORT: 

No C0 2 levy support etc 


Economic study by Shock -1986 for UK DoE 
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Main economic targets for EGS (From [29]) 


meters of contact surface area is required between the 
fluid and the rock mass [12]. For an EGS reservoir to 
last 20 years, a numerical model indicates that a total 
rock volume of around 200 million cubic meters is 
required [12]. Numerical modeling also indicates that 
this will necessitate a separation between the wells 
(injection and production) of around 600 m to achieve 
the above rock volume. 

Parasitic Losses These are defined as a reduction in 
the net profit caused by the use of power required by 
various operations. Some of these are associated with 
the power required for pumping the fluid around the 
system and is quantified in MPa/L/s. Economic models 
suggest that 0.1 MPa/l/s is acceptable. Additionally, the 
system may lose some of the injected fluid in the far 
field due to the character of the site. In this case, 
additional water may be required to keep the system 
pressurized. This is normally quoted in a % of the exit 
flow. This additional water has to be paid for and comes 
out of the net profit. This is incorporated in the eco¬ 
nomic model, and around 10% is included as accept¬ 
able losses, but the lesser the amount of fluid losses, the 
better will be the return on the investment. Recent 
development of EGS reservoirs shows that typical 
water losses would be around 1-3%. 


One of the key figures in the exploitation of the 
technology is the interest rate on the borrowed capital. 
This can play an important part in the net return on the 
investment as the construction of the overall plant may 
take anything from 3 to 4 years, depending on the 
depth, number of wells, etc. 

More recently [31, 32], revisited economic model¬ 
ing of a “Soultz-like” project. One of their key findings 
was that parasitic pumping power significantly impacts 
on the net power output from an EGS plant (see also 
section Parasitic Losses below). Higher flow rates 
require greater pumping power, and there will typically 
be an optimal pumping/flow rate to maximize the 
economic return of the system. The optimal 
pumping/flow rate might not necessarily be the maxi¬ 
mum achievable flow rate. 

Technical and Economic Achievements to Date 

Reservoir Creation Mechanism The initial focus was 
on applying the hydrofrac technique used in the hydro¬ 
carbon industry for creating an EGS reservoir. This 
consisted of creating a penny-shaped single new frac¬ 
ture (tensile) by injecting fluid under high pressure 
[33-35]. Subsequently, it was recognized that this was 
not the way forward as a penny-shaped single fracture 
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does not create sufficient heat transfer area nor the 
fractured volume required for sustained heat extraction 
at high temperature for prolonged period. 

Observations during the creation of EGS reservoirs 
showed that the reservoir in a natural-jointed rock 
mass grows by shear failure on existing fractures 
[17, 18]. Shear failure occurs when the pore pressure 
in the fracture increases and reaches the value 
of normal stress on the fracture, causing the fracture 
surfaces to part and for shear failure to take place. 
After shear failure, joints are propped open by asperi¬ 
ties. Observations before and after stimulation at 
Soultz demonstrated an increase in “productivity 
index” of wells GPK2 and GPK4 by a factor of ~20 
[36]. This, in turn, effectively reduces the pumping 
power required to circulate fluid through a system, as 
well as providing increased heat transfer area and frac¬ 
tured rock volume. Additionally, the shear mechanism 
produces microseismic noise, which is used to monitor 
the influence of stimulation in real time [18]. 

Size and Lifespan of the Reservoir The separation 
between the injection well and the production well, to 
a large extent, determines the size of the reservoir. 
Initially, this was in the range of 120-180 m because 
of a concern that hydrofracturing in granite would be 
difficult and that it might not be possible to link two 
wells hydraulically over many hundreds of meter. Fol¬ 
lowing the change in the concept of reservoir creation 
mechanism (from predominant tensile to shear fail¬ 
ure), a separation in excess of 600 m between the 
injection and production well is regarded as normal. 
The largest separation of wells at Soultz is 670 m 
[37-40], and over 1,000 m at the EGS project in 
Landau, Germany. Numerical modeling indicates that 
with a separation of this order, an EGS reservoir ought 
to last over 20 years [41] and that using three wells 
instead of two (i.e., one injector and two producers) 
may extend the life even longer. However, no EGS 
reservoir has yet been circulated this long for 
confirmation. 

One of the methods used for assessing the size of an 
EGS reservoir is by using tracers [42]. Breakthrough 
time and model volume are good indicators. 

At Soultz, it was felt that the required heat transfer 
and rock volume were achieved between GPK2 and 
GPK3, but not between GPK3 and GPK4. The poor 


connection between GPK3 and GPK4 was due to 
a poor stimulation strategy [38, 39], further explana¬ 
tion for this is given in section Parasitic Losses. 

Parasitic Losses This was a big issue at the early stages 
of the development of EGS technology. One of the main 
energy losses is due to the power required to pump the 
fluid through the reservoir to collect the heat energy. 
This resistance to flow (impedance) is quantified as 
pressure required to flow per liter per second (MPa/l/s). 
In early days, until 1990, the best impedance achieved was 
around 0.4 MPa/l/s with a flow rate of 15 L/s with 
a separation between wells of around 200 m at the 
Rosemanowes site in the UK. At Soultz, this was reduced 
to ~0.23 MPa/l/s with a separation of 450 m between 
wells and a production flow of ~26 1/s. This EGS reser¬ 
voir was at 3.6 km depth. It was felt that the impedance 
and flow rate could be improved further by having a three 
well system (one injector and two producers). Subse¬ 
quently, three more wells (GPK2, GPK3 and GPK4) 
were drilled to 5,000 m depth to access ^200° C rocks, 
with a separation in excess of 600 m. Following stimula¬ 
tions of GPK2/GPK3, the flow impedance was found to 
be around 0.29 MPa/l/s, thus for three wells one would 
expect to half this to around 0.15 MPa/l/s [37, 38] . GPK4 
was also stimulated, but a reduction in impedance was 
not achieved because GPK4 was poorly connected to 
GPK3, most likely, due to a change in the strategy for 
stimulation [38]. 

Stimulation between GPK2 and GPK3 was carried 
out using a newly developed stimulation technique 
[38] that involved injecting in both wells simulta¬ 
neously (“focused stimulation”). GPK4 was stimulated 
by injection in one well only. Additionally, there was an 
indication that the previous stimulation in GPK3 may 
have created a stress barrier between GPK3 and GPK4. 
High pressures would have been required to break 
through this barrier, which might have been possible 
using focused stimulation [38] but was not possible by 
stimulating just one well. 

Tracer tests indicated a breakthrough time 
between GPK3 and GPK2 in the order of 3.5 days, 
and between GPK3 and GPK4 of 28 days [42]. 

Power Output In view of the poor hydraulic connec¬ 
tion between the wells GPK3 and GPK4 at Soultz, the 
flow rate achieved was lower than expected and 
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therefore the full potential of the system was not real¬ 
ized. Originally, it was planned to produce around 
30-35 1/s from each production well, giving a total 
flow of around 60-70 1/s and power generation of 
3-4 MWe. This proved impossible due to poor connec¬ 
tion between GPK3 and GPK4. The plan at present is to 
produce around 2 MWe and gradually develop further 
to increase the flow rate and characterize the existing 
reservoir during the circulation period [36]. 

On the other hand, the EGS project at Landau 
increased the flow rate by stimulation from 50 to 
~100 1/s, and power production of between 2.8 
and 3.5 MWe has been achieved (personal 
communication). 

Future Scenarios Which May Lead to a Mature EGS 
Technology 

EGS has distinct advantages over other renewable 
energy sources in that it can deliver a base load or 
a peak load, it is a strategic (domestic) resource, has 
low visual impact, and has a very low carbon foot print 
(Fig. 4) in the range of 10 g C0 2 /kWh. 

Some of the disadvantages are finding the right 
environment for EGS development, the high cost of 
developing this technology, uncertainty about under¬ 
ground rock properties (although this is dramatically 
reduced after the drilling of the first well and obtaining 
the in situ properties), and community concern over 


Mitigation potential of geothermal power plants in the world based 
on data of Table 6 and assumptions for emission of 120 g C0 2 /kWh 
for today and 10 g C0 2 /kWh for future technology (EGS). 
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C0 2 Emission from various energy resources 


generating seismic events (although experience sug¬ 
gests that this can be managed and mitigated). For 
example, the EGS project in Basel, Switzerland 
(DHM), generated a seismic event of the magnitude 
3.4 while injecting to create a reservoir. The conse¬ 
quence was the shutdown of the project due to com¬ 
plaints of the local residents and consequence 
associated with it. This could have been avoided by 
taking the guide line provided by the protocol on 
induced seismicity [43] and a white paper published 
by IEA/GIA [44]. One of the recommendations in the 
protocol is to avoid establishing an EGS project in 
the area where there has been large earthquakes in the 
past or such continuous activities. In 1356, the city of 
Basel was almost entirely destroyed by a 6.5-magnitude 
earthquake. Basel is a known moderately active earth¬ 
quake zone in Switzerland [45] and a possible way 
forward would have been to explain the situation to 
local residents, seek their agreement to go forward, and 
limit the stimulation to smaller volume. 

Overall, the advantages of EGS appear to far out¬ 
weigh the disadvantages, providing a power stimulus 
for continued development of the technology. Uptake 
will be progressive, starting with the technically and 
economically easiest styles of development and 
progressing to more challenging environments. 

It is becoming apparent for the initial rapid take-up 
of this technology the easiest targets are large faults with 
high permeability, as at Landau, Germany. Observations 
at Soultz and other places have indicated that large faults 
often exist at great depth in granite basement, and those 
that are roughly aligned with the maximum horizontal 
stress orientation can potentially deliver increased geo¬ 
thermal fluid production by boosting the bulk perme¬ 
ability and fluid storage of a production zone. However, 
the hydromechanical properties of faults are inherently 
heterogeneous and anisotropic, making it challenging to 
distinguish between permeable and impermeable faults 
prior to drilling and testing. The probability of locating 
zones of enhanced fault permeability can be improved by 
preliminary fault stress state modeling. Such modeling 
can reduce the uncertainty and risk of exploring for 
fault-related geothermal targets, particularly in early 
stage exploration projects or in areas of unknown or 
complex geology [46]. 

One recent development has been the realization by 
the conventional geothermal industry that the heat 






Engineered Geothermal Systems, Development and Sustainability of 


E 


3509 


available in the vicinity of hydrothermal systems is 
significantly greater than what is produced from aqui¬ 
fers within those fields. The knowledge acquired from 
the development of EGS is being applied to extend 
existing hydrothermal fields to access much larger vol¬ 
umes of hot rock mass to contribute to the power 
production. Approaches include stimulations of 
existing dry wells and drilling new wells to connect to 
the hydrothermal reservoir, provided the wells are 
located in a favorable direction with respect to the 
local stress regime. ORMAT Technologies is carrying 
out pioneering work in this field at Desert Peak near 
Reno, USA. There is tremendous potential for the take- 
up of this technology once the concept is demonstrated. 

The present economic conditions in many parts of 
the world mean that a combination of heat and power 
is likely to generate higher revenue for EGS plants 
rather than just power alone. This has been the policy 
pursued by some EGS companies, e.g., EGS Energy Ltd 
(www. egs-energy.com). The use and distribution of 
heat may not be applicable everywhere, but a serious 
consideration is essential to increase revenue and eco¬ 
nomic sustainability of projects. 

Around 3-6 MWe EGS power plants are a starting 
point in the initial take-up of the technology, but 
much larger power generating capacity (25-50 MWe) 
must be demonstrated before the technology can be 
considered mature and acceptable as a big power 
player. In order to achieve such a goal, a concept put 
forward by EGS Energy Ltd, is to successfully replicate 
EGS reservoirs again and again. Taking the example of 
the achievement at Soultz and assuming that the 


connection between GPK3 and GPK4 could have had 
similar characteristics to that between GPK2 and 
GPK3, the module of one injector and two producers 
can be replicated again and again in parallel and serial 
configuration. Each module could produce around 
5 MWe, so three stacked in series could give 
~15 MWe (Fig. 5). Three 15 MWe systems stacked in 
parallel would produce ~45 MWe. This configuration 
could be extended/developed to generate a larger 
power the range of up to 100 MWe or more. An impor¬ 
tant criterion in developing such strategy is the unifor¬ 
mity of stress and thermal gradient so that once 
a prototype with appropriate characteristics has been 
developed and tested, it could be replicated with rela¬ 
tive confidence. Minor occasional variations in charac¬ 
teristics can be rectified using an appropriate tool kit of 
responses as necessary. 

Sustainability of EGS Systems 

Sustainability can be briefly defined as “the capacity to 
endure, hold up or maintain.” In the current atmo¬ 
sphere of community concern for the environment, 
the question of sustainability arises for any technology 
that is being developed for the future. For EGS, sus¬ 
tainability could relate to maintaining the temperature 
and pressure in the reservoir, the electricity output 
from the power plant, the economic return on invest¬ 
ment, the lifespan of surface plant, or other measures. 
At present, the question of long-term sustainability for 
EGS is virtually impossible to quantify. Only one EGS 
commercial system is in existence, and this has been 
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Scaling up scenarios for producing 15-130 MWe (from EGS Energy Ltd) 
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operating only in the last few years. The question of 
sustainability still arises, however, and one can possibly 
address this at present through preliminary observa¬ 
tions [47], numerical modeling, and value judgment 
from working in the development of such systems for 
over 30 years. 

In an EGS power plant, one assumes the term 
“endurance” is aimed at the available thermal energy 
in the reservoir being able to recharge itself in 
a sufficient time to be used again in the future, and 
not associated with the mechanical items such as the 
casing of the well or the power plant. Sustainability 
questions will be raised when attempts are made to 
improve or extend the life of an existing EGS reservoir 
to conserve resource and/or improve profitability. 

For example, calculations suggest that the recovery 
of the depleted heat in a fractured rock reservoir may 
take around 75 years to recover by the natural conduc¬ 
tion process from the surrounding rocks [12]. This 
assumes that no convection process is involved, and 
all the heat transfer is by conduction. In order to extract 
heat from the upper crust using EGS methods, existing 
fractures must be opened up by a shearing mechanism 
to improve fluid flow. In this case, the fractured volume 
may internally act as convective cells [48] . A convection 


process is more efficient in transferring heat, and if this 
is the case, then it is possible that a depleted EGS 
reservoir might thermal recover in a shorter time than 
by conduction alone. 

One of the observations during the 5-year circula¬ 
tion at the Rosemanowes site was that as the center part 
of the reservoir was cooling [49] , causing a shrinking of 
the rock mass, the outer part of the reservoir was being 
opened up and played a greater part in the extraction of 
heat by the creation of new flow paths. This was con¬ 
cluded from tracer tests that suggested the generation 
of new flow paths. This suggests that if one manages the 
reservoir correctly, then there is a good possibility that 
additional channels could open up through time and 
thus improve sustainability. 

Experience at Soultz demonstrates that a number of 
reservoirs can be created within the same granite body, 
above or beside each other (Fig. 6), depending on the 
stress regime. The upper reservoir (3,600 m) at Soultz was 
circulated initially for a while and was then sealed off. 
A new reservoir was created below the first at 5,000 m 
depth. This same concept could be used to create not just 
two but three EGS reservoirs, one stacked above the 
other. The bottom reservoir (system 3) would be 
exploited first. When it had been sufficiently depleted, 
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Stacking up of EGS reservoirs (an example from Soultz and a conceptual model) 
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the reservoir would be isolated and the one above it 
opened and used for generating power. Once the second 
one is depleted, then that would also be isolated and the 
upper reservoir would be used. Assuming that each res¬ 
ervoir lasted for around 25 years, by the time the upper 
reservoir is depleted, the bottom reservoir would be 
recharged to almost the original temperature through 
conduction from the surrounding rocks. This suggests 
that the wells have to be completed in such a way that 
a number of EGS reservoirs can be created one above the 
other, such that any one can be isolated to operate inde¬ 
pendently of the others. A mechanism like this may make 
it possible for EGS reservoirs to be sustainable for 
a century or more. 

Although EGS projects can offer a viable alternative 
renewable energy source, they should not be considered 
in isolation from other local constraints. For example, 
the creation and operation of an EGS project using 
a water-cooled power plant requires a local reliable 
source of water at reasonable cost. Furthermore, EGS 
field projects carried out throughout the world have 
experienced the impact of water losses through trial 
production testing. Controlling water losses is an 
important aspect of sustainability for a geothermal 
energy project particularly in areas of restricted water 
availability such as Geodynamic’s Habanero project 
located in arid outback Australia. Water losses can 
have significantly negative economic and environmen¬ 
tal impacts if not managed. Techniques proposed to 
control water losses include minimizing injection pres¬ 
sures to control reservoir growth during circulation, 
the use of injection proppants, and improving hydrau¬ 
lic connectivity between wells through the use of tech¬ 
niques such as “focused stimulation” [12, 50]. 

In hydrothermal systems, operators need to manage 
the system not just to prolong its life but also to make it 
economically attractive. This requires characterization 
of the reservoir using tracers to monitor the change in 
the reservoir properties and evaluation of the geochem¬ 
istry to manage dissolution and precipitation effects. 
Geochemistry aspects associated with EGS reservoirs 
are still uncertain and not yet quantified [51, 52]. This 
is again a function of there being no long-term operat¬ 
ing plants yet. Some of the experiments done at the 
Soultz site during various circulation tests suggest that 
dissolution, precipitation, and corrosion may be 


controlled by controlling the pH of the circulating 
fluid, maintaining back pressure on the production 
flow, introducing anti-corrosion agents in the system, 
and by pumping the production fluid back in to the 
formation without exposing the fluid to the open 
environment. 

There has been some concern in the past that pref¬ 
erential flow paths may develop as the reservoir cools, 
and that this may cause the abandonment of a reservoir 
earlier than planned. This to some degree has been 
addressed by increasing the distances between injector 
and producer wells from ~200 m to over 600 m. The 
experience at Soultz suggests that these distances may 
be further increased to over 1,000 m if one uses 
a focused stimulation method. Focused stimulation 
allows the pressure to build up rapidly between the 
wells to shear existing joints at distances some hun¬ 
dreds of meters from any one well, and also takes 
significantly less time and volume of water. The result 
is a more consistent system of stimulated joints 
throughout the rock volume, and hence a lower prob¬ 
ability of short circuits. Additionally, tracer tests car¬ 
ried out regularly in the reservoir would give an early 
indication of the development of preferential flow 
paths (short circuits), and then methods could be 
employed to seal them using various chemicals. Such 
sealing agents (calcite based) are currently being devel¬ 
oped to seal a path temporarily or permanently. This, in 
principle, should help to extend the life and sustain¬ 
ability of a reservoir. 

As mentioned earlier, sustainability is an issue that 
cannot completely be quantified until there are 
a number of operating EGS reservoirs for a long time, 
but the indications are that there is every probability 
that most issues affecting sustainability can be man¬ 
aged to provide an optimal return on investment. 

Possible Road Map for EGS 

There is a tremendous quantity of heat stored in the 
Earth (~10 13 EJ, [47]), predominantly from the decay 
of natural radioisotopes. Some of this heat resource is 
being successfully exploited all over the world at the 
tectonic boundaries where the crust is thin. These 
resources have, to a great extent, large volumes of 
very hot fluid trapped underground and are accessed 
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by drilling in to them (Iceland, New Zealand, etc.). 
These resources are classified as a hydrothermal 
resource, but there are extensive areas of the world 
where the crust is not so thin, but extensive heat energy 
is available at greater depth. 

EGS technology provides a mechanism to exploit 
this available heat resource at great depth by drilling in 
to it and creating an artificial reservoir by pumping 
fluid under pressure, opening up natural fractures to 
allow cold fluid to flow through it, and extract the heat 
to drive turbines at the surface. Only recently the tech¬ 
nological and financial support was available to create 
such a system with relevant parameters to make it 
commercially viable. This was achieved under an 
international research environment which facilitated 
funds and scientists to address these problems and 
solve them. 

International Energy Agency has recognized the 
potential of EGS worldwide and the cutting edge geo¬ 
thermal research required to make it successful. IEA has 
therefore set up a team to formulate a road map to 
assist the exploitation of this technology. 

I suspect some of the aspects covered for the road 
map will be: 

• What geophysical technique will help in selecting an 
ideal EGS site? 

• How can one determine stress values and gradient 
in that specific area? 

• How can one encourage joint international research 
to address some of the uncertainties, such as the 
behavior of an EGS reservoir in different stress 
regimes; how long will the reservoir last, etc. 

• Can one support new drilling techniques to reduce 
the cost of drilling? Drilling can be over half of the 
cost of development of an EGS Project 

• Facilitate a procedure of fast tracking for planning 
permission to reduce effective cost. 

• Understand or develop mitigating methods to 
reduce large induced seismic events in EGS plants. 

• Provide a mechanism such as insurance scheme to 
assist in any large drilling accidents. 

• Long-term international research on the behavior 
of EGS reservoir and the control of likely geochem¬ 
ical, dissolution, and precipitation problems. 

• Improve binary power plants to improve power 
conversion efficiency and reliability. 


Future Directions 

EGS is has gone through the fundamental research to 
scientific demonstration stage, and it is now ready for 
application in commercial environment, but, it does 
need supporting financial incentive to encourage 
potential investors. 

These can be: 

• A feed in tariff similar or better than that set by 
Germany will encourage potential investors to be 
prepared to take some risk. 

• Additionally, the potential of being able to provide 
an abundant supply of hot water from an EGS plant 
has not been taken seriously, particularly as it can 
provide this 24 h a day and almost 95% of the time 
in a year. 

• Environment friendly premium, as it has virtually 
no C0 2 emissions or any major pollutants. 

• A separate feed in tariff for the supply of hot water. 

• EGS has a potential for becoming a provider of 
large-scale power (200MWe +) and heat. This can 
become a strategic power base and additional finan¬ 
cial support from government will help investors to 
expand in this direction. 

• Insurance schemes to protect against unforeseen 
accidents during drilling. 

• Establishment of fast tracking method in obtaining 
planning and fulfilling local regulations. This can 
take a long time and can become a major burden to 
investors at an early stage of development. 
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Glossary (According to the IAEA nuclear safety 
glossary [1]) 

Acceptable limit A limit acceptable to the relevant 
regulatory body. 

Accident Any unintended event, including operating 
errors, equipment failures or other mishaps, the 
consequences or potential consequences of which 
are not negligible from the point of view of protec¬ 
tion or safety. 

Action level The level of dose rate or activity concen¬ 
tration above which remedial actions or protective 


actions should be carried out in chronic exposure 
or emergency exposure situations. 

Contamination Radioactive substances on surfaces, 
or within solids, liquids, or gases (including the 
human body), where their presence is unintended 
or undesirable, or the process giving rise to their 
presence in such places. 

Cost-benefit analysis A systematic economic evalua¬ 
tion of the positive effects (benefits) and negative 
effects (disbenefits, including monetary costs) of 
undertaking an action. 

Countermeasure An action aimed at alleviating the 
radiological consequences of an accident. 

Decommissioning Administrative and technical 
actions taken to allow the removal of some or all 
of the regulatory controls from a facility. 

Decontamination The complete or partial removal of 
contamination by a deliberate physical, chemical, 
or biological process. 

Dose A measure of the energy deposited by radiation 
in a target. 

Emergency A nonroutine situation or event that 
necessitates prompt action, primarily to mitigate 
a hazard or adverse consequences for human health 
and safety, quality of life, property, or the 
environment. 

Evacuation The rapid, temporary removal of people 
from an area to avoid or reduce short-term radia¬ 
tion exposure in an emergency. 

Exemption level A value established by a regulatory 
body and expressed in terms of activity concentra¬ 
tion, total activity, dose rate, or radiation energy, at 
or below which a source of radiation may be 
granted exemption from regulatory control without 
further consideration. 

Exposure pathway A route by which radiation or 
radionuclides can reach humans and cause 
exposure. 

Exposure The act or condition of being subject to 
irradiation. 

Intervention Any action intended to reduce or avert 
exposure or the likelihood of exposure to sources 
that are not part of a controlled practice or that are 
out of control as a consequence of an accident. 

Justification (for intervention) The process of deter¬ 
mining whether a proposed intervention is likely, 
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overall, to be beneficial, as required by ICRP’s Sys¬ 
tem of Radiological Protection , i.e., whether the ben¬ 
efits to individuals and to society (including the 
reduction in radiation detriment) from introducing 
or continuing the intervention outweigh the cost of 
the intervention and any harm or damage caused by 
the intervention. 

Protective action An intervention intended to avoid 
or reduce doses to members of the public in emer¬ 
gencies or situations of chronic exposure. 

Reference level An action level, intervention level, 
investigation level, or a recording level. 

Relocation The nonurgent removal or extended exclu¬ 
sion of people from a contaminated area to avoid 
chronic exposure. 

Remedial action Action taken when a specified action 
level is exceeded, to reduce radiation doses that 
might otherwise be received, in an intervention 
situation involving chronic exposure. 

Remediation Any measures that is carried out to 
reduce the radiation exposure from existing con¬ 
tamination of land areas through actions applied to 
the contamination itself (the source) or to the expo¬ 
sure pathways to humans. 

Risk A multi-attribute quantity expressing hazard, 
danger, or chance of harmful or injurious conse¬ 
quences associated with actual or potential exposures. 
It relates to quantities such as the probability that 
specific deleterious consequences may arise and the 
magnitude and character of such consequences. 

Risk assessment Assessment of the radiological risks 
associated with normal operation and possible acci¬ 
dents involving a source or practice. 

Site evaluation Analysis of those factors at a site that 
could affect the safety of a facility or activity on 
that site. 

Waste management All administrative and opera¬ 
tional activities involved in the handling, 
pretreatment, treatment, conditioning, transport, 
storage, and disposal of radioactive waste. 

Definition of the Subject and Its Importance 

There are a large number of sites around the world 

contaminated with radionuclides which, to varying 

extents, require attention. Some of these sites were con¬ 
taminated by nuclear testing or radiation accidents while 


others became contaminated as a result of authorized 
discharges (e.g., hospitals) or inadequate waste manage¬ 
ment and/or waste disposal practices. In recent years, 
problems associated with radioactively contaminated 
sites have led to considerable concern amongst both 
the general public and decision makers. To a large extent, 
the concern has been stimulated by the occurrence of 
several radiation accidents, most notably the Chernobyl 
accident. These issues have also gained a particular 
importance because of an increased public concern 
about the safety of radioactive waste disposal and the 
associated justification of a sustainable development of 
nuclear power. 

The first stage of any strategy responding to con¬ 
tamination is an evaluation of the need for action. 
Decisions to use any actions to protect the population 
or remediate the environment are normally based on 
a comprehensive site evaluation and justification of the 
intervention, if necessary. This analysis should then be 
able to identify appropriate management options. 

Many factors govern the effectiveness and cost of 
countermeasures or remedial actions and need to be 
evaluated to identify a suitable strategy. The contribu¬ 
tions of these factors are site specific, varying with 
the characteristics of the contamination and the receiv¬ 
ing environment as well as on factors such as local and 
national priorities for remediation planning. 

Environmental restoration is often considered as a 
system of actions having a final goal to return a site to 
the conditions that prevailed before the contamination. 
However, in practice, this is often not feasible, espe¬ 
cially if vast areas are affected and require a response. 

Planning of environmental countermeasures or 
remedial actions is a complicated, multidimensional 
task which needs to take account of radiological, 
ecological, and socioeconomic issues within a holistic 
approach. The challenge is to identify suitable, 
effective, individual actions or procedures (such as 
remediation technologies, communication strategies, 
or decision support technologies) within a holistic 
consideration of the entire range of available counter¬ 
measures and remediation actions. 

Introduction 

Contamination of the environment to an extent that 
requires countermeasures or remediation may occur 
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due to many different reasons. Obvious sources of 
radioactive contamination which may require the use 
of countermeasures include radiation accidents or inci¬ 
dents, but remediation may also be needed for radia¬ 
tion legacy sites [2-7]. Radioactive contamination can 
be caused inadvertently by human activities involving 
processes in which natural radionuclides can become 
concentrated in areas not normally controlled by reg¬ 
ulatory bodies. Such activities can include conventional 
mining and processing of ores, production of 
phosphogypsum, etc. [8]. There are also many other 
sources of radioactive contamination such as the appli¬ 
cation of radionuclides in industry, research, and med¬ 
icine. The responses to various accidents and existence 
of some highly contaminated legacy sites have required 
the development of a variety of countermeasure and 
remediation management options to protect human 
health or to remediate the environment. 

According to the current international terminology, 
contamination refers to radioactive substances which 
occur on surfaces, or within solids, liquids, or gases 
(including the human body), where their presence is 
unintended or undesirable [1]. However, almost all 
human activities are accompanied by the release of 
contaminants into the environment and such releases 
are normally under strict regulatory frameworks. 

The terms countermeasure and remediation are fre¬ 
quently used, but often applied differently by various 
authors. Such a situation can often lead to misinterpre¬ 
tations in descriptions of the activities applied to 
decrease radiation impact on the public and the envi¬ 
ronment in an emergency or in exposure situations 
when contamination is already present. To clarify the 
issue, the terms here are used consistently in the fol¬ 
lowing way. Management options which have been 
applied in the case of emergency are countermeasures, 
and they can include both protective and remedial 
actions [1]. However, the same management options 
when applied in the situation of existing exposure are 
considered remedial actions. Existing exposure in this 
situation includes options applied when an emergency 
exposure situation has been declared to have ended or 
in areas contaminated by residual radioactive material 
from past practices, and are part of a remediation pro¬ 
cess. The more informal term environmental counter¬ 
measure is also used elsewhere for management options 
used as remedial actions. 


There is no definitive set time period when an option 
will change from a countermeasure to a remediation in 
the case of emergencies. The transition from an emer¬ 
gency exposure situation to a subsequent existing expo¬ 
sure is characterized by a change in management, from 
countermeasure strategies mainly driven by urgency, 
with potentially high levels of exposures and predomi¬ 
nantly central decisions to more decentralized remedia¬ 
tion strategies aiming to improve living conditions and 
reduce exposures as low as reasonably achievable given 
the circumstances [9]. These strategies must take into 
account the long-term dimension of the situation with 
the possible direct involvement of the exposed individ¬ 
uals in their own protection [10]. 

There are some management option countermea¬ 
sures such as evacuation or application of iodine tablets, 
which can be considered only as protection actions and 
never as remedial actions since they are only used in the 
acute or early phase of an emergency. Conversely, some 
management options can be used both as countermea¬ 
sures and remedial actions, depending on the situation 
or interpretation of a purpose of the application, such as 
clean feeding of animals to decontaminate them. 

A wide range of countermeasures and remedial 
actions have been developed, tested, and implemented 
in contaminated areas. As a result, a large amount of 
data on their effectiveness has been generated, together 
with information on ancillary factors such as the required 
resources and costs. These measures vary considerably in 
their effectiveness, cost, and practicability and this differ¬ 
ence has led to a need to develop a variety of approaches 
and adequate decision-aiding technologies and Environ¬ 
mental Decision Support Systems (EDSSs) that can be 
effectively used for countermeasure or remediation plan¬ 
ning and optimized application of such actions [11-13]. 

The main sources of the data on the effectiveness 
of management options has been the application of 
countermeasures in areas affected by major radiation 
accidents, remediation of nuclear testing sites, and 
areas affected by nuclear facilities or other human 
nuclear activities such as the use of radionuclides for 
medical, research, or industrial purposes. Analysis of the 
lessons learned from these case studies has been invalu¬ 
able in demonstrating that the efficiency of protection of 
management options depends on many factors, includ¬ 
ing environmental conditions, properties of the radio¬ 
nuclides, management on the affected areas, and 
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response from the local population and stakeholders 
[11, 14-17]. This leads to a need to adopt a holistic 
approach, the application of which is to be facilitated 
by modern regulatory frameworks for countermeasures 
and remediation of contaminated areas [10], especially 
with a target of sustainable development of the 
environment. 

Origin and Characteristics of Contaminants 
in the Environment 

There are many different sources of radioactive sub¬ 
stances in the environment. At present, the most impor¬ 
tant sources which have resulted (or which still result) in 
the need for the application of countermeasures or 
remediation can be divided into two main groups. The 
first group can be termed radiation legacy : this group 
includes military test explosions, radiation accidents, 
nuclear weapon accidents, early design nuclear fuel 
cycle facilities, and nuclear applications with inade¬ 
quate operational and disposal practices [2, 4]. In 
some of these cases, the application of countermeasures 
was necessary to protect the population of the affected 
areas before remediation of these sites was considered. 

The second group consists of existing human activ¬ 
ities including uranium mining, processing of ore, fuel 
fabrication, energy generation, reprocessing of nuclear 
fuel, and various other applications of radionuclides in 
industry, research, and medicine. For all these activities 
countermeasures are not required because the 
corresponding discharges of radionuclides into the envi¬ 
ronment or the measured concentrations of radionu¬ 
clides in the environment are below the acceptable 
limits. However, remediation is a planned and necessary 
part of the decommissioning of these facilities and the 
sites should be restored to the status which (under 
existing regulations) allows release of the site to 
unrestricted use (ideally) or for some specific purposes 
(restricted use) if the first option is not feasible. 

Radiation Legacy Sites 

Most sites requiring large-scale remediation are part of 
a “radiation legacy” [4]. Radionuclides of concern at 
such sites have a long physical half-life, high radiolog¬ 
ical toxicity, and high mobility in the environment or 
are elements (including analogs) which have an essen¬ 
tial role in the biosphere [18]. 


During the period 1940-1970, many sites were 
heavily contaminated because of the prevailing radioac¬ 
tive waste disposal practices at facilities for nuclear 
weapons material production. In particular, large quan¬ 
tities of radioactive liquid wastes, sometimes mixed with 
other chemicals, were commonly disposed of directly 
into the ground, surface water bodies, or marine systems 
(for instance at the Mayak, Hanford, and Sellafield pro¬ 
duction facilities). These disposal practices led, in some 
instances, to serious environmental problems which 
required, or still require, remediation [2, 4, 17]. 

Another major legacy group arose from nuclear 
explosives, which have been detonated above ground, 
on the surface, and below ground at many test sites 
around the world. Nuclear weapons accidents have 
occurred around the world since the 1950s. Severe 
accidents include the losses of nuclear weapons from 
aircraft at Palomares (Spain) in 1966 and Thule in 
Greenland in 1968 [17, 19-21]. 

Additionally, about 100 civilian peaceful nuclear 
explosives have been performed in the former Soviet 
Union countries and in the USA for different purposes 
(constructions of dam, gas reservoirs, etc.). Although 
the contribution of these explosives to the global dose 
was very low [5], in some local areas contamination of 
the environment has required application of remedial 
actions [6, 22, 23]. 

Severe radiation accidents at the Mayak radiochem¬ 
ical plant (Kyshtym accident) in 1957 and the Cherno¬ 
byl nuclear power plant in 1986 have led to significant 
contamination by long-lived radionuclides over many 
years, requiring long-term remediation [17, 24-26] 
after the acute phase. 

In some cases, either through accidents or a lack of 
radiation safety culture, sites have been left contami¬ 
nated with residues even from authorized operations. 
Such sites include factories where radium was used in 
luminescent paint and thorium was used in thorium- 
coated gas mantles [4]. 

Existing Nuclear Activities and Current Sources 
of Radioactive Contamination 

Due to an improved radiation safety culture over the 
last several decades, the discharge of radionuclides into 
the environment has greatly declined in many countries 
[5]. Despite low discharges of radionuclides, nuclear 
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fuel cycle facilities still contribute to contamination of 
the environment (mainly within the facility) to 
a sufficient extent that remedial actions might be advis¬ 
able even in the absence of radiation accidents. Natu¬ 
rally occurring radioactive materials (NORMs) such as 
238 U, 232 Th, and their progeny are often the most sig¬ 
nificant environmental contaminants at such sites. 
NORMs can also occur in many ores containing metals 
such as copper, gold, coal, and monazite. Residues from 
the production of phosphoric acid and various phos¬ 
phates may also cause contamination of the environ¬ 
ment [27]. The combustion of coal for electricity 
generation, smelting processes, and heating leads to 
the release and dispersion of some natural radionu¬ 
clides, mainly radium, in the environment. 

Normally, remediation of all such sites is an oblig¬ 
atory part of the decommissioning process and is sub¬ 
ject to special compliance monitoring [17, 28] which 
must prove that affected areas have been adequately 
remediated before it is permissible for them to be used 
for other purposes. 

Currently, radionuclides are extensively used for 
a variety of scientific, medical, agricultural, and indus¬ 
trial purposes [2]. The major users of unsealed radio¬ 
active sources are hospitals where nuclear practices are 
used for diagnostic and therapeutic purposes, and to 
a lesser extent for treatments. Some of the radionu¬ 
clides used for such activities ( 3 H, 14 C, 32 P, 35 S, and 
13: 1) occur in the liquid effluents which can be 
discharged into the environment [6]. These radionu¬ 
clides are also in use in a range of research applications 
such as genetic studies, animal health or environmental 
studies, while 147 Pm is widely used in industry [4]. 

An additional long-term source of contamination 
of the environment is radioactive waste. Environmental 
contamination may occur during the collection, 
processing, storage, and disposal of wastes associated 
with the use of radioactive isotopes for industrial, 
research, and medical applications [4, 6]. 

International Frameworks for Application 
of Countermeasures and Remediation 
of Contaminated Areas 

The major source of international recommendation on 
this subject is the International Commission on Radio¬ 
logical Protection (ICRP). The ICRP is the primary 


international body that recommends the fundamental 
principles and quantitative basis upon which appropri¬ 
ate radiation protection measures can be established. 
It leaves the responsibility of formulating the specific 
advice, codes of practice, or regulations tailored to the 
individual countries (http://www.icrp.org/about.asp). 

Overall, the ICRP’s recommendations serve as 
a major basis for all protective actions, including envi¬ 
ronmental countermeasures and any remedial actions. 
International standards and supporting technical doc¬ 
uments, such as the International Basic Safety Stan¬ 
dards for Protection Against Ionizing Radiation and 
for the Safety of Radiation Sources [29], are also 
based on ICRP recommendations. 

The ICRP recently revised its approach to charac¬ 
terizing situations in which exposure to humans 
may occur. Instead of a process-based approach [30] 
distinguished by its use of two concepts, practices and 
interventions , the ICRP suggested a situation-based 
approach to characterize exposure situations [31]. 
This approach replaces reference to practices with 
planned exposure situations and replaces interventions 
with emergency exposure situations and existing exposure 
situations [9]. At present, the ICRP considers three 
major exposure situations [31]: 

1. Planned exposure situations are situations involving 
the deliberate introduction and operation of radio¬ 
active sources. 

2. Emergency exposure situations are situations that 
may occur during the operation of a planned situ¬ 
ation, or from a malicious act, or from any other 
unexpected situation, and require urgent action to 
avoid or reduce undesirable consequences. 

3. Existing exposure situations are situations that 
already exist when a decision on control has to be 
taken, including prolonged exposure situations 
following emergencies. 

Overall, radiation protection is based on three fun¬ 
damental principles, namely, justification , optimization , 
and individual dose limitation. Within the context of 
remediation, the fundamental principles can be formu¬ 
lated as justification of remediation, optimization of 
remedial actions, and restriction of residual individual 
doses [9] . Together, they provide a basic framework for 
the application of remediation in long-term exposure 
situations. 
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Any remediation should be justified and remedial 
measures shall do more good than harm and shall pro¬ 
vide optimized arrangements to maximize the net ben¬ 
efit to society [ 1] . Thus, the total benefit of reduction of 
exposure through remediation must be greater than 
any detriment that the remediation may cause. Overall, 
remediation shall [1]: 

(a) Reduce the doses to individuals or groups of indi¬ 
viduals being exposed 

(b) Avert doses to individuals or groups of individuals 
that are likely to arise in the future 

(c) Prevent or reduce environmental impacts from the 
radionuclides present in the contaminated area 

Justification of remediation shall go far beyond the 
scope of radiological protection and can include eco¬ 
nomic, political, environmental, social, and psychologi¬ 
cal facets. Justification of remediation should be based 
on cumulative benefits and impacts of individual reme¬ 
dial actions composing the remediation strategy. Some 
individually justified actions may be available, but may 
not provide a net benefit when considered as an overall 
strategy because, for example, collectively they bring too 
much social disruption for the considered exposed pop¬ 
ulation as a whole, or they are too complex to manage. 
Conversely, a single protective action may not be justi¬ 
fied alone, but may contribute to an overall net benefit 
when included as part of a protection strategy [9]. 

The responsibility for ensuring an overall benefit to 
society as well as to individuals when the population is 
allowed to continue living in contaminated territories 
is with governments or national authorities. At the 
same time, the current approach also emphasizes an 
important role to the individuals concerned. In existing 
exposure situations, justification should be considered 
for all remedial actions that may be included in 
a remediation strategy: those implemented centrally 
and locally by authorities, experts, and professionals 
and those directly implemented by the exposed indi¬ 
viduals as self-help protective actions with the support 
of the authorities. The remediation strategy defined 
by the authorities should also take into account the 
ability of affected individuals to take self-help initia¬ 
tives. However, as self-help protective actions are 
implemented - and thus largely decided - by the inhab¬ 
itants themselves, they must be properly informed and 
even trained (to use the means and equipments 


provided by the authorities) to take decisions 
concerning their own protection with a net benefit. 

The process of optimization of protection is 
intended for application to those situations for which 
the implementation of protection strategies has been 
justified. Thus, if justified, remediation of contami¬ 
nated area should be optimized. The level of protection 
to be achieved by the remediation should be the best 
under the prevailing circumstances, maximizing the 
margin of benefit over harm. 

Owing to its judgmental nature, there is a strong 
need for transparency of the optimization process. The 
decision-making process shall provide for the involve¬ 
ment of a wide range of interested parties in the 
definition, implementation, and verification of reme¬ 
diation programs and for regular public information 
exchange on the implementation of these programs 
[1]. All the data, parameters, assumptions, and the 
values that enter into the process should be presented 
and defined very clearly. This transparency assumes 
that all relevant information is provided to the involved 
parties, and that the traceability of the decision-making 
process is documented properly, aiming for an 
informed decision [31]. 

The optimization principle should be applied in the 
same manner in all exposure situations. Restrictions 
are applied with respect to the permitted dose to 
a nominal individual (or representative person). The 
representative person is an individual, who will almost 
always be a hypothetical construct, receives a dose that 
is representative of the more highly exposed individuals 
in the population [32]. The representative person is 
equivalent to, and replaces, the average member of the 
critical group recommended previously by the ICRP 
[33-35]. 

Options which result in the representative person 
receiving a greater dose than the restrictions permit 
should be rejected at the planning stage. Importantly, 
these dose restrictions are applied prospectively, as 
with optimization as a whole. If, following the 
implementation of an optimized protection strategy, 
it is subsequently shown that the permitted dose for the 
representative person has been exceeded, the reasons 
for this should be investigated but the exceeding alone 
should not necessarily encourage remedial action. 

To avoid inadequate outputs of the optimization 
procedure, restrictions should be placed on the 
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potential doses or risks to individuals who remain in 
contaminated territory or implement remedial actions. 

The main factors to be considered for setting the 
reference levels for existing exposure situations are the 
feasibility of controlling the situation, and the past 
experience with the management of similar situations. 
In most existing exposure situations, there is a desire 
from the exposed individual as well as from the author¬ 
ities to reduce exposures to levels that are close or 
similar to situations considered as “normal.” This 
applies particularly to situations of exposures from 
material resulting from human actions. 

The experience gained in the aftermath of the Cher¬ 
nobyl accident has shown that nontechnical factors can 
affect remediation policies enormously [36]. The ICRP 
recommendations are based on objective assessments 
of the health risks associated with exposure levels and 
on radiological protection attributes of various expo¬ 
sure situations. However, members of the public (and 
sometimes their political representatives) may have 
personal views on radiation risks and feasibility of 
remedial actions intended for dose reductions. This 
can lead to different perceived needs for intervention 
and the extent of remediation, depending on the origin 
of the exposure. The public concerns are normally 
stronger when the environment is contaminated by 
artificial radionuclides than when these are of natural 
origin. An elevated exposure due to natural radiation 
sources are often ignored by the public, while relatively 
minor exposures to artificial long-lived radioactive res¬ 
idues are often a source of concern and sometimes lead 
to a request in remediation even it is not justifiable in 
terms of radiological protection standards. 

As the long-term objective for existing exposure 
situations is to reduce exposures to levels that are close 
or similar to situations considered as normal , the ICRP 
recommends selecting the reference level for the opti¬ 
mization of protection of individuals living in contam¬ 
inated territories in the lower part of the 1-20 mSv/year 
band [31]. 

The reference level represents the level of dose or 
risk above which it is judged to be inappropriate to plan 
to allow exposures to occur, and below which optimi¬ 
zation of protection is implemented. Optimization is to 
be applied in emergency and existing exposure situa¬ 
tions, even if the initial doses are below the defined 
reference levels. 


Past experience has demonstrated that typical 
values used for constraining the optimization process 
in long-term postaccident situations fall in the range of 
the dose limit for planned exposure situations, namely, 
around 1 mSv/year [9]. National authority may take 
into account the prevailing circumstances and also 
usefully take account of the timing of the overall reha¬ 
bilitation program to adopt intermediate reference 
levels to progressively improve the situation. 

Although the above considerations were sometimes 
referred to the chronic exposures situation after the 
radiation accidents, the above framework is also 
entirely applicable for the sites of former practices 
and other similar exposure situations. Thus, if expo¬ 
sure level at the site of a former practice satisfies 
the reference level for all its future plausible uses, the 
site may be released for unrestricted use and the 
decommissioning phase of the practice can be termi¬ 
nated. If this is not feasible, the site may still be released 
for restricted use or released for unrestricted use after 
remediation. However, these restrictions and remedial 
actions (if any) can be considered a type of intervention 
since some form of institutional control will be still 
required [9]. 

Site Evaluation for Remediation Purposes 

Remediation of any contaminated site starts with a site 
evaluation, which includes a comprehensive analysis 
of the factors that could affect the need of such actions 
and effectiveness of their application [37]. Overall, site 
evaluation should assist in making three major deci¬ 
sions. First, site evaluation should confirm that the 
site is contaminated to an extent requiring remedia¬ 
tion. Second, it should be able to suggest appropriate 
management options if compliance with radiation 
safety criteria was not confirmed. And third, a site 
evaluation also includes a demonstration of remedia¬ 
tion success and justification to release a site for future 
use [17, 28]. 

A site evaluation includes investigations covering 
[17]: 

• Properties of both contaminants and the 
environment 

• Detailed consideration of factors that could affect 
radiation safety at a site or of an activity that could 
result in a release of radioactive material and/or 
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could affect the dispersion of such material in the 
environment 

• Population and access issues relevant to safety (e.g., 
feasibility of evacuation, location of people and 
resources) 

To achieve these tasks, information on the site 
under evaluation should be made available and include 
data that is reliable and sufficient for making proper 
decisions. 

If, after discovery of contamination, a radiological 
scan of a site shows high levels of radioactivity in some 
areas, then a grid survey using properly calibrated 
equipment is carried out. The property boundaries 
and pathways of the natural drainage system should 
be given special attention. This survey should also 
identify whether dose limits for members of the public 
or workers using a site might be exceeded under some 
circumstances within the boundaries of the site and to 
what extent [28]. 

If the need for application of countermeasures in the 
acute stage of the emergency or remediation was con¬ 
firmed, a detailed site inspection needs to be undertaken. 
The aim of this survey is to determine the spatial extent 
of the problem, and to gather sufficient information to 
allow a full dose assessment for potential site occupiers 
and off-site populations. Any radiological threats to the 
environment from the existing situation, or from possi¬ 
ble remediation actions, also have to be identified. 

To achieve these objectives, an optimized sampling 
(measurement) program is required. The development 
of such program commences with the determination of 
baselines and comprises of a review of the history of all 
nuclear activities at the site. Alternatively, the presence of 
contamination has to be identified by measurements 
and expert judgment if no other information is available. 

If a consideration of the results of this characteri¬ 
zation leads to a decision to undertake protective 
actions, then detailed investigations are necessary to 
enable an optimized remediation strategy to be 
constructed. In addition, characterization of any 
waste stream may be necessary before the transport 
and disposal options for the waste are approved. 

One final stage is required in the case of remedia¬ 
tion, namely, a final survey (a compliance monitoring) 
to demonstrate that the objectives of remediation 
were successfully achieved. It also ensures that 


(1) radionuclide dispersion and migration will not 
have deleterious effects on the population and (2) the 
environment and the site can be reused following reme¬ 
diation. At that stage, compliance of residual concen¬ 
trations of radionuclides in the environment with the 
acceptable levels must be verified and any areas with 
residual contamination must be identified and the 
nature, quantity, and distribution of the radioactivity 
determined. An example of the procedure, demonstrat¬ 
ing interactions between a site remediation and evalu¬ 
ation within one generalized process is outlined in 
Fig. 1. 

Management Options for the Application 
of Countermeasures and Remediation for 
Different Environments 

The main objective of a management option as 
a countermeasure or as part of a remediation strategy 
is to reduce or prevent doses to humans. Although 
inhalation of resuspended material is a potential path¬ 
way, the doses in many situations arise largely from 
either the consumption of contaminated foodstuffs 
and water via ingestion or from exposure to contami¬ 
nated surfaces in an inhabited area via external irradi¬ 
ation. A secondary objective is to provide reassurance 
to consumers and people living in contaminated areas 
and thereby maintain public confidence. 

Countermeasures can be implemented at different 
phases of an emergency response extending from before 
the radioactive contamination is actually deposited and 
continuing for a varying amount of time depending on 
the characteristics of the accident or incident. The focus in 
the discussion here is on management options used after 
the acute phase of an accident and during remediation 
although countermeasures are mentioned to some extent. 

The environmental behavior and mobility of radio¬ 
active contamination, and thus the potential severity of 
an emergency or existing exposure situation in terms of 
the predicted dose received by humans depends greatly 
on (1) the composition and physicochemical form of 
the radionuclides present and (2) the characteristics 
of the ecosystem affected by the contamination and 
how it is used by humans [38-43] . Thus, many different 
factors influence the selection of which management 
options are most appropriate to apply and when. When 
developing a remediation strategy for the long-term 
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Environmental Countermeasures and Restoration. Figure 1 

Key stages in remediation and release of contaminated sites (Adopted from [4]) 
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recovery of contaminated areas, some of the most 
important factors to consider are described below. 

Factors and Processes Influencing Management 
Options 

Management options are largely designed to reduce 
(1) the ingestion dose from the consumption of con¬ 
taminated foodstuffs and drinking water, (2) external 
dose rates from deposited radionuclides, and (3) inha¬ 
lation dose from resuspended material. A wide range of 
factors can influence the administration, impact, and 
consequence of the use of various management 
options. Some of these factors are considered briefly 
below and others in more detail in subsequent sections. 

Temporal Factors These include the time of year that 
the contamination happened, the variation in activity 
concentrations of the deposited radionuclides with 
time and their rate of movement in the environment. 
In the short term, prompt countermeasures may be 
necessary such as housing dairy cattle if they are grazing 
outdoors at the time of deposition. However, many 
management options take time to organize and imple¬ 
ment. Where management options involve significant 
and/or irreversible actions, then stakeholder consulta¬ 
tion is advisable if time permits, but this may often only 
be possible in the recovery phase of an accident when 
defining a remedial strategy. 

Spatial Factors The size of the area affected, extent of 
contamination present, and land use will influence the 
choice of appropriate management options can be used. 
In heavily contaminated areas, priority has initially to be 
given to the direct protection of the population rather 
than options for agricultural production. In other less 
badly affected areas, restrictions on the entry of food into 
the food chain may be required or management options 
to reduce radioactive contamination in food products. 
Finally, adjacently, relatively unaffected areas will also 
need to be monitored in case of indirect effects and to 
maintain public confidence. The transfer of radionu¬ 
clides may vary with factors such as soil type. In reme¬ 
diation situations, this may mean that less heavily 
contaminated areas outside the major zone of concern 
with sensitive soil types allowing higher radionuclide 
uptake into plants are actually producing more highly 


contaminated food products than those areas which 
received higher deposition [44-46]. 

Effectiveness of Management Options Information 
on effectiveness is generally presented as a percentage 
reduction in activity concentration in the target 
medium (i.e., soils, crops, building surfaces, roads) 
after implementation. There are some management 
options (such as live monitoring) which do not directly 
reduce doses but increase the effectiveness of other 
options and may also provide reassurance. Effective¬ 
ness is influenced by many factors including when it is 
applied relative to the time of deposition, application 
rates, duration of treatments, the radionuclide depos¬ 
ited and its form, biological half-live in animals, effec¬ 
tive half-life in different ecosystems, timeliness of 
implementation, compliance, and acceptability. 

Radiological Impact (Protection of Implementers) 

A radiological impact can occur due to incremental 
doses received by people who implement management 
options during procedures that are not part of normal 
practices. The magnitude of the doses received will 
depend on many variables such as the type of radionu¬ 
clide released and what becomes contaminated, the expo¬ 
sure pathways (external, inhalation, ingestion, irradiation 
of the skin) resulting from carrying out the management 
options (including disposal of wastes) and length of 
exposure time. Doses to implementers are controllable 
so the ICRP principles of protection apply, namely, 
justification, optimization and, dose/risk limitation. 

Economic Cost Many factors influence the monetary 
cost of implementing management options [47] . Direct 
costs which are more obvious are the cost of imple¬ 
mentation, labor, consumables, equipment and trans¬ 
port, and the cost of handling wastes and carrying out 
environmental remediation. But there are other less 
often considered costs such as those incurred through 
loss of production, loss of retail sales, closure of busi¬ 
nesses, and the provision of temporary premises. Indi¬ 
rect costs are often equally important, although they 
are inherently more difficult to quantify. Examples are 
loss of market share, regional impact on tourism, which 
could extend to area adjacent to those which are con¬ 
taminated, and the impact on society when people 
cannot return to normal lifestyles. 




Environmental Countermeasures and Restoration 


E 


3525 


Importantly, the option not to carry out any man¬ 
agement options can also be a valid alternative. If 
authorities decide that countermeasures or remedia¬ 
tion is not required, then it is important that this 
decision is accompanied by a monitoring strategy 
aimed at reassuring the local population. This option 
should be considered if measurements and assessments 
indicate that the doses to people living in the area 
would be low. No judgment is made here on what 
would constitute a low dose. Other factors could influ¬ 
ence the decision not to implement any remediation, 
such as availability of limited resources or a very large 
area being affected. 

A large number of management options have been 
developed and applied, especially since the Chernobyl 
accident. In the 1990s, initial efforts were made in the 
STRATEGY project to synthesize the information on 
available management options for the early to late phase 
after an accident, and to critically evaluate whether they 
were generally applicable and useful to areas other than 
those in which they had been applied [48] . The manage¬ 
ment options can be subdivided according to the dif¬ 
ferent categories discussed below; some management 
options in one category are also applicable to others. 

The most important generally applicable manage¬ 
ment option is the imposition of food bans to prevent 
contaminated food products from entering the food 
chain. This is a highly effective, but expensive option 
and would be an automatically imposed countermea¬ 
sure due to regulatory procedures after accidents if 
contamination levels were predicted to exceed the rel¬ 
evant action levels. Alternatively, it may be considered 
appropriate in certain circumstances to raise action 
levels. For example, this was done in Scandinavia after 
the Chernobyl accident to protect the Saami way of life. 

Countermeasure and Remedial Actions for 
Agricultural Food Chains 


concerns, but also a wide range of other issues such as 
the local economy, societal, and ethical concerns, and 
disposal of wastes. 

In the pre-deposition phase, responses will obvi¬ 
ously be dependent on the extent and quality of infor¬ 
mation available. Potential countermeasures include 
the prevention of contamination reaching food prod¬ 
ucts such as switching off ventilation or irrigation 
systems in greenhouses, etc., or covering foodstuffs 
to prevent surface contamination. Livestock can 
also be sheltered from the contamination by being 
housed. 

Agricultural food chain management options in 
the early to late phase can be divided into two types, 
namely, intervention along the soil-crop pathway and 
in animal production systems. The former focuses on 
reducing the contamination of crops including cereals, 
industrial crops, vegetables, and horticultural crops, 
food plants for domestic animals, and fruit. It is there¬ 
fore largely aimed at reducing dose to humans. 
A management option which applies to both groups 
is the selection of alternative land uses whereby con¬ 
taminated land may be used for nonfood production, 
such as flax, bioenergy, or biofuel crops, or for the 
production of leather and wool. Land may also be 
used for forestry, but this is unlikely to be the preferred 
option. The various available management options 
have one or more of the following aims: 

• To remove contamination completely, usually 
through topsoil removal 

• To reduce soil-to-plant transfer of radionuclides 
through a variety of techniques including deep, 
shallow, or skim and burial plowing, and soil treat¬ 
ment to reduce radionuclide transfer from soil to 
plants 

• To reduce radionuclide contents at the food 
processing stage through chemical procedures or 
dilution. 


E 


The agricultural food chain supplies the majority of 
food to humans. Therefore the application of manage¬ 
ment options to this exposure pathway is a critical part 
of any emergency response or remediation strategy. In 
food production systems, management options are 
targeted to various media and contamination pathways 
including soil, crops, livestock, and other animal prod¬ 
ucts. They are not only aimed at addressing health 


If fodder plants are too highly contaminated for 
direct human consumption, they can still be fed to 
agricultural animals in the early stages of growth as 
long as there is sufficient time for decontamination 
thereafter, prior to slaughter. 

Intervention in animal production systems is 
largely aimed at reducing the ingestion dose from 
meat, milk, and other dairy products, offal, eggs, and 
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other foodstuffs derived from agricultural animals 
[45, 49-52]. They generally involve: 

• Decontamination of animals by feeding the animals 
uncontaminated feeds; the efficiency of “clean 
feeding” depends on the biological half-life of each 
radionuclide in the muscle or other products 

• Reduce ingestion of contaminated feed by selective 
grazing regimes 

• Reduce gut uptake of radionuclides by the addition 
of (1) specific radionuclide binders to the animal 
feed such as ammonium hexacyanoferrate for 
radiocaesium and (2) stable analogs such as Ca to 
suppress radiostrontium absorption 

• Slaughtering of dairy livestock, which may be nec¬ 
essary if uncontaminated feed is not available (or 
other countermeasures are not possible); lactation 
might be suppressed in conjunction with this 
option 

• Manipulating slaughter times to minimize the 
activity concentrations, by allowing physical decay 
of short-lived radionuclides and allowing longer 
finishing periods (sometimes with clean feed) to 
ensure that the meat is below action levels 

• Reduce radionuclide contents at the food 
processing stage 

Many of the above management options can be 
combined with live monitoring to determine whether 
activity concentration in animals are below the action 
limits and/or optimize application of the management 
options. 

Countermeasures and Remedial Actions for 
Domestic and Free Foods from Extensive 
Ecosystems Including Forests 

Extensive ecosystems include unimproved meadows 
(such as in mountainous areas or forest clearings) 
used for grazing livestock as well as forested areas. For 
grazing animals, some of the management options 
listed above may be applicable, although some animals 
in these areas are handled infrequently, so many of the 
most effective measures such as clean feeding are diffi¬ 
cult to implement. 

Forested areas are used by humans both for forestry, 
recreation, and as sources of wild foods. Dose pathways 
for forests may include external exposure from the 


forest floor, contamination in trees, handling of for¬ 
estry material, and industrial production based on 
contaminated wood. Internal exposure arises from 
consumption of forest foods such as mushrooms, 
nuts, and berries. Finally inhalation of radionuclides 
can occur following forest fires or combustion of con¬ 
taminated wood [53]. 

The reduction of some of the doses occurring in 
forests soon after deposition of radionuclides requires 
measures aimed at the atmospheric deposition process 
itself and are difficult to implement, so restricting access 
to the contaminated forest areas maybe the most impor¬ 
tant short-term countermeasure. Over longer time 
periods, other available management options include 
selective harvesting to avoid the most contaminated 
wild foods, preventing forest fires, providing advice on 
use of wood ash and reducing soil-plant uptake. 

Countermeasures and Remedial Actions for 
Aquatic Ecosystems 

Aquatic ecosystems include lakes, rivers, groundwater, 
and marine waters, each of which has contrasting 
characteristics due to hydrological, hydrochemical, 
or morphological factors. The need for application of 
management options for aquatic ecosystems is, there¬ 
fore, largely dependent on site-specific parameters which 
often severely constrain what can be done. Furthermore, 
management option application in contaminated waters 
may be expensive, including large engineering costs so 
cost-benefit analysis is particularly important. 

In general, population doses from aquatic pathways 
are often less than from terrestrial pathways depending 
on food habits. The main dose pathways from aquatic 
sources are from their use as drinking water supplies 
and for irrigation, and as a source of aquatic foodstuffs. 
If lakes used in power production, which may alter lake 
volume, are contaminated then the occasional expo¬ 
sure of contaminated bottom sediments can lead to the 
risk of contaminated sediments becoming airborne. 
Due to the self-shielding of water, external doses from 
recreational use of contaminated lakes and rivers can 
often be relatively low. 

Intervention in aquatic systems involves reducing: 

• Contamination reaching the water body 

• Uptake by fish 

• Ingestion of contaminated feed 
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Management options for drinking water are 
targeted at public and private water supplies and can 
be divided into two main groups: 

• Provide clean water through alternative supplies 

and changes to the source of the supply 

• Treatment of water (blending and/or purification) 

Many of the above groups of management options 
are more effective if combined with dietary advice, the 
provision of monitoring kits, food labeling, and com¬ 
pensation schemes. 

Countermeasures and Remedial Actions for 
Inhabited Areas 

The urban ecosystem is where most people live and 
constitutes a wide variety of different structures and 
natural surfaces such as grassed areas, paved areas, 
exterior walls, roof paving, trees, shrubs, and interior 
surfaces that can all become contaminated and contrib¬ 
ute to dose. The relative magnitude of the dose contri¬ 
bution from these different structures and surfaces, 
and thus the specific needs for management options 
depends strongly on the release scenario. Clean-up may 
result in large volumes of contaminated material 
requiring disposal. 

For contaminated surfaces there is a need to ade¬ 
quately identify and quantify the problem, including 
the type and spatial distribution of the radioactive 
contamination. This is because inadequate and incom¬ 
plete application of management options which do not 
take account of the overall spatial and vertical distribu¬ 
tion of radioactive contamination can be much less 
effective than expected when correctly applied [54]. 

Although most remediation techniques for 
inhabited areas are designed to reduce external dose 
[55, 56], non-acute inhalation doses received from 
resuspended contamination can be substantial. To 
reduce this pathway, techniques to fix contamination 
to the surface to suppress dust can be used in a later 
recovery phase [57]. 

Management options for inhabited areas comprise 
two main groups that (1) shield people from the 
contamination (shielding options) and (2) remove 
contamination (removal options also called 
decontamination or clean-up options). They may be 
further classified according to the type of target surface 


(i.e., buildings, road and paved areas, soils and grass, 
trees and shrubs). 

Shielding options can be used to reduce both exter¬ 
nal exposure and the intake of contaminated material, 
and are effective in providing protection against 
both exposure pathways [58]. The effectiveness of 
a shielding option is defined as the reduction in the 
external dose rate from a surface (e.g., buildings, paved 
surfaces, grass, soil, and shrubs), generally expressed 
as a percentage, after the implementation of the option. 
If the primary aim is to reduce external exposure, 
shielding materials can be placed between the contam¬ 
ination and people (burial and covering of objects). 
The use of shielding materials is potentially a particu¬ 
larly effective option for short-lived radionuclides 
emitting alpha or beta radiation. Some more perma¬ 
nent shielding options, such as burial of contaminated 
material or the permanent relocation of people from 
a contaminated area are also effective for long-lived 
radionuclides and gamma-emitting radionuclides. 
There are three main types of shielding options: 

- Burial and covering of objects including the use of 
clean topsoil or sand in gardens and other open 
areas and the turning of paving slabs, triple digging, 
and plowing in park areas 

- Restricting people from gaining access to contami¬ 
nated areas or objects, or relocating people from 
the area 

- Fixing of contamination to the surface and 
restricting its mobility 

Removal options involve the decontamination or 
clean-up of contaminated surfaces and objects. The 
effectiveness of a removal option is defined as the 
ratio of the activity initially present on a specific surface 
(e.g., buildings, paved surfaces, grass, soil, and shrubs) 
to that remaining after implementing the option. This 
ratio is usually called the decontamination factor (DF). 

Surface treatments for buildings and infrastructure 
include road planning, vacuum sweeping of roads 
and walkways, high-pressure water hosing of walls 
and/or roofs, sandblasting of walls, ammonium treat¬ 
ment of walls, roof cleaning by cleaning device, inten¬ 
sive indoor surface cleaning, and firehosing of roads 
and walkways. One of the main disadvantages of these 
management options is that contaminated waste mate¬ 
rial is often produced in large quantities. Also, 
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decontamination techniques such as high-pressure 
hosing and sandblasting may damage old or poorly 
maintained brick or stone buildings. 

Treatment of the gardens and parks can also be 
considered and available management options include 
snow removal, topsoil removal, lawn mowing and 
pruning, or removal of trees and shrubs. 

Waste Disposal Options 

Waste disposal options have been summarized most 
recently by Nisbetetal. [58, 59] in Recovery Handbooks 
and are summarized briefly here. For food production 
systems, considerable amounts of biodegradable waste 
can arise if food bans are implemented. Since the 
restrictions are based on statutory requirements they 
have to be carried out at short notice and appropriate 
routes of disposal need to be identified in advance of 
future accidents or incidents. 

However, several management options also pro¬ 
duce contaminated by-products and routes for their 
disposal must be considered. Contaminated produce 
that might require disposal include crops and by¬ 
products from processing of grass products (fresh 
grass, silage, hay), milk and by-products from 
processing, whole animal carcasses and meat, and soil. 

Waste treatment options that can be carried out 
on-site include composting, land spreading of milk 
and/or slurry, and plowing in of a standing crop. 
Off-site options include biological treatment (diges¬ 
tion) of milk, burial of carcasses, disposal of contami¬ 
nated milk to sea, incineration of crops, landfill, 
processing and storage of milk products for disposal, 
and rendering of animal carcasses (which converts 
waste animal tissue into stable, value-added materials). 

The contamination of an inhabited area generates 
both solid and liquid waste regardless of whether any 
countermeasures are applied. In inhabited areas, con¬ 
taminated items may need to be removed and facilities 
such as those for water treatment may become con¬ 
taminated by waste water from management options 
such as firehosing buildings and by contaminated pre¬ 
cipitation and runoff. The categories of radioactive 
waste are (1) contaminated waste (refuse) and goods, 
(2) solid and liquid waste from clean-up of the con¬ 
taminated area and (3) waste water from precipitation 
and natural run-off. It is therefore important to 


consider the impact of the contaminated waste on the 
public, workers handling the waste, the environment, 
and normal waste disposal practices. 

Waste disposal schemes for solid contaminated 
waste should preferably be planned for in advance, 
with potential disposal sites and resources identified 
if possible [54]. The selection of suitable disposal 
sites needs to take account of the proximity, cost, 
hydrogeological characteristics of the site, geological 
stability, and future land use. 

Construction of a repository (Fig. 2) should con¬ 
sider a number of other factors: 

- There should be adequate shielding so that the 
contamination in the repository does not contrib¬ 
ute significantly to the external dose rate in the area. 

- The ecological characteristics of the area should be 
assessed and intrusion by humans, animals, and 
plant roots needs to be prevented, usually by using 
suitable protective layers. 

- Migration of radionuclides from the waste with 
rainwater must be prevented, usually by draining 
the rainwater off in a gravel layer on top of the waste 
layer. 

- The repository should be protected against erosion. 

- The acceptability of constructing repositories needs 
to be assessed. 

- The repository may be equipped with a drainpipe at 
the bottom for inspection of the radionuclide con¬ 
tents in any water passing through the construction. 

- The design should be simple, inexpensive, and pos¬ 
sible to construct quickly. 

Secondary (or Side) Effects of the Application 
of Management Options 

Environmental impacts occurring due to the applica¬ 
tion of management options can be positive or nega¬ 
tive, direct or indirect, and can lead to social disruption 
or damage to the environment [60]. Direct environ¬ 
mental impacts can include changes in biodiversity, soil 
fertility and structure, and enhanced soil erosion as well 
as pollution of air and aquatic ecosystems. Indirect 
effects may be less immediately obvious, but could 
include restrictions to an individual ability to follow 
their selected lifestyle. 

The Chernobyl Forum, based on the related studies 
in areas affected by the Chernobyl accident, reported 
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Environmental Countermeasures and Restoration. Figure 2 

The principles of a simple ground repository for radioactively contaminated remediation waste (e.g., soil), as 
recommended by Junker et al. [53] 


that some management options led to detrimental 
social and economic effects, particularly a breakdown 
in the social infrastructure [61] which could also be 
considered amongst the most serious consequences of 
the accident. Nevertheless, remediation can also 
include options to restore ecosystems, and secure liveli¬ 
hoods and the social structure of affected populations, 
or stabilize the economic situation [62]. 

Side or secondary effects of management options 
can be very diverse. If all access by the public is 
restricted and access for forestry is limited, major losses 
to the population would include a lack of locally col¬ 
lectible: wood for industrial purposes and for heating; 
berries and mushrooms; forest fodder for dairy cows 
and firewood for rural populations. The cost of forestry 
is also increased due to limitations on the time which 
forest workers can spend in contaminated forests 
which, in turn, can lead to deterioration in the main¬ 
tenance of forests with negative ecological conse¬ 
quences. Application of restrictive management 
options such as limited access to forest in the areas 
affected by the Chernobyl accident had negative psy¬ 
chological and sociological consequences. The eco¬ 
nomic losses were also large [60, 62]. 

Implementation of soil-based agricultural actions 
can change soil properties or decrease air and water 


quality, while land use-based remedial options 
can change the landscape and its use, impacting 
the economic or recreation leisure value of an area 
[12, 48, 60]. 

Societal and Ethical Aspects 

Overall, the primary factors and effects described above 
led to many ethical issues including assessment of the 
consequences of remedial actions implementation for 
future generations, sustainability, and comparing harm 
to the environment to benefits to humans. When both 
radiological factors and social aspects are taken into 
account, better acceptability of countermeasures by the 
public can be achieved [12, 61]. Thus, the impact on 
individual and communities needs to be considered. 
Local stakeholders need to be consulted in the identi¬ 
fication of problems and their solution. 

Some management options may have a negative 
impact on society by causing disruption (e.g., through 
restricting access or activities), anxiety, and stress (e.g., by 
causing panic, upheaval), and stigma (e.g., by affecting 
businesses or tourism). These need to be balanced against 
positive impacts through provision of reassurance and 
improvements to living conditions. There are also ethical 
issues to be addressed when selecting management 
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options, including: provision of self-help options that 
reinforce liberty and dignity, the distribution of doses 
over space and time and between different members of 
the community, animal welfare, environmental risk, and 
consequences for future generations [62]. 

The ethical acceptability of remediation is highly 
dependent on the ecological status of the area and the 
degree to which the actions diverge from usual practice. 
In most cases, environmental legislation must also be 
considered [63]. 

Studies on effectiveness of remediation implementa¬ 
tion in many areas affected by the Chernobyl accident 
have identified a variety of factors that had a highly 
detrimental impact on the economic and social activities 
on contaminated areas. Practically all remedial options 
were too expensive to be competitive at the market, and 
required large subsidies from the state to support eco¬ 
nomic activity in the affected regions. In most cases, 
remedial actions imposed a stigma on the areas with 
contaminated environments [62]. Similar psychological 
effects have also been observed in some other European 
countries [64]. This constrained or prevented export of 
even high-quality products to nonaffected areas, 
constrained economic and social development of 
those regains, and promoted migration of some of the 
working-age population to areas with lower contami¬ 
nation levels. As a consequence, economies and social 
structures in affected communities deteriorated, along¬ 
side an apparent increase in poverty [62]. 

The provision of information, and how that infor¬ 
mation is communicated, will have a significant influ¬ 
ence on how the authorities manage the situation, on 
society’s response to the problem and on the overall 
success of the recovery strategy. Maintaining public 
confidence is paramount. Trust is easily lost and diffi¬ 
cult to develop and/or regain so it is important to create 
a framework for information and communication 
strategies under noncrisis conditions with the involve¬ 
ment of stakeholders. The type of information dissem¬ 
inated should be targeted to meet a variety of needs. 
The form of communication should be adapted to 
different levels of understanding, circumstances and 
to address the relevant issues. This should be 
implemented at the same time as the development of 
restoration strategies. 

Management option implementation is generally 
the responsibility of the authorities and carried out 


by designated personnel. Nevertheless, self-help 
implemented by the affected population can also be 
considered. Some of the simple management options 
that do not require any specific skills or experience to 
implement (such as lawn mowing, digging) are poten¬ 
tially suitable for implementation by affected inhabi¬ 
tants themselves [65]. This could, for instance, be 
beneficial in providing an extra labor force in situations 
where large areas need to be treated quickly. The 
involvement of affected persons in actions to improve 
their own situation can be psychologically beneficial 
and give a better feeling of control of the situation, 
which also prevents undue anxiety. Health and safety 
issues need to be carefully considered for people carry¬ 
ing out such “self-help” options, need careful commu¬ 
nication and to be conducted with adequate 
supervision of individuals to ensure they are 
implemented correctly. 

The Compendium of Management Options 
Datasheets 

The concept of systematically describing all the key 
factors which need to be considered in the application 
of management options in a datasheet format was 
developed in the mid-1990s within the STRATEGY 
project [41, 48, 55, 66]. Standardized datasheets, 
mainly based on the Chernobyl experience, capturing 
key information required to carry out the management 
options and evaluate their feasibility, applicability, and 
practicality were developed to aid the decision-making 
process and to ensure that potentially important issues 
are not overlooked. They also described the resources 
needed. The information in the datasheets was 
intended to be generally applicable for use in different 
European countries. The rural and waste templates are 
being used by the joint FAO/IAEA division to be 
adapted to other climate types and to update the 
IAEA Handbook of parameter values for radionuclide 
transfer in the temperate environments 364 [67]. 

Stakeholder participation is an important mecha¬ 
nism to explore these additional benefits or disadvan¬ 
tages to the use of management options, and is an 
essential step in developing a decision framework 
which avoids problems previously experienced in 
emergency management [68]. Close liaison with the 
FARMING stakeholder network facilitated evaluation 



Environmental Countermeasures and Restoration 


3531 



of management options datasheets for rural ecosys¬ 
tems; while for other datasheets, stakeholders 
were consulted in small groups or individually. 
Stakeholder opinion suggested that some manage¬ 
ment options were as likely to be rejected on socio- 
ethical grounds as on technical and economic 
grounds. Rejection of specific management options 
can be expected to show site, context, and national 
differences. 

The datasheets developed in STRATEGY have been 
gradually improved during the EURANOS project [69] 
with more detail provided and the scope widened to 
provide information representing more diverse con¬ 
ditions and for wider range of radiological hazards. 
Sources of contamination considered in the hand¬ 
books include nuclear accidents and radiological dis¬ 
persion devices and the most relevant radionuclides 
are included. In addition, non-radiological consider¬ 
ations have been incorporated in the later generations 
of datasheets [48, 57], introducing viewpoints from 
experts on social, ethical, and economical sciences 
[68], and also considering documented reactions of 
the public and other participants during management 
option trials in the Former Soviet Union. 

The most recent compendium of datasheets is inte¬ 
grated into decision-aiding handbooks for contami¬ 
nated food production areas [58, 59] and inhabited 
areas [57]. These handbooks contain advice on the 
development of optimized strategies for remediation 
under specific conditions using a series of diagrams, 
tables, and decision-charts, which guide the user 
through a series of eliminations of unsuitable options. 
Site-specific factors need to be taken into account in 
this process, and it is recommended in the handbooks 
that they be customized and tailored for different users. 
The importance of adequately involving people in deci¬ 
sions that affect them is stressed. 

The most recent revision of the datasheet template 
for food [58] and inhabited areas [55, 57] includes 
a wide range of criteria describing different aspects of 
each countermeasure including the objective and ben¬ 
efits, the target of the measure and the radionuclides 
targeted, constraints (e.g., legal and social), effective¬ 
ness, feasibility (equipment, skills, infrastructure 
needed), waste produced, doses to implements, costs, 
side effects of implementation, and stakeholder opin¬ 
ion of the measure. 


The datasheets provide a comprehensive overview 
of management options [69] which are intended to be 
useful to “middle-level” decision makers who have to 
assess possible countermeasure options. They are not 
a comprehensive set of instructions, but provide 
a template to adapt to local conditions/languages. 
Their applicability will reduce across the world as cli¬ 
mate, culture, production systems, and drinking water 
supply deviate from those currently encountered in 
Europe. 

Countermeasures and Remediation in Areas 
Affected by Radiation Accidents 

Various radiation accidents requiring an organized 
emergency response have resulted from a variety of 
different factors, including human error (Chernobyl 
accident), violations in procedures at a radioactive 
waste tank (Kyshtym accident), the crash of air¬ 
crafts and subsequent loss of nuclear weapons 
(e.g., Palamares, Spain and Thule, Greenland) and the 
loss or theft of sealed radioactive sources used for 
medical purposes (Goiania, Brazil). Contamination 
scenarios and the radionuclides released differed in 
each case, thus requiring a variety of different counter¬ 
measures and remediation strategies. 

Kyshtym Accident 

An accident at a nuclear facility producing weapon 
grade plutonium in Chelyabinsk-40 (Soviet Union) 
led to one of the largest radioactive releases into the 
environment. Sr-90 was the most important dose 
forming radionuclide in the release and required exten¬ 
sive countermeasures and remediation. The resulting 
East Urals Radioactive Trace (EURT) was defined as the 
area where the 90 Sr contamination density at the time 
of the accident was above 3.7 kBqm -2 , a value which 
was twice that of global fallout of 90 Sr from nuclear 
tests. In the EURT area (23,000 km 2 ) there were 217 
settlements with 270,000 inhabitants [70-74]. Within 
the EURT, a contamination density of 74 kBqm -2 for 
90 Sr was used as the criterion to identify areas requiring 
management options [74]. Heavily contaminated areas 
were in a narrow band, 4.5-6 km wide and 105 km long 
covering an area of 1,000 km 2 (c. 5% of the EURT area). 
All economic activity within the contamination iso- 
lines of 74-150 kBqm -2 of 90 Sr was suspended 
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(resulting in a restriction zone of 700 km 2 ). In 1959, 
this area included 106,000 ha of which agricultural 
lands, forests, and lakes accounted for 54%, 36%, and 
9%, respectively [70]. 

Agriculture The Kyshtym accident was the first case 
where vast areas received countermeasures soon after 
the accident and remediation activities in the long 
term. In the first phase, the major contributor to the 
dose was the internal exposure from 144 Ce uptake by 
humans with foodstuffs [73]. The maximum concen¬ 
tration of radionuclides in agricultural products on 
land closest to source areas (up to 20 km) reached 
10-10,000 kBqkg -1 , 144 Ce and 95 Zr being the main 
contributors to these values except for milk where 
90 Sr comprised a mean of 70% of the activity concen¬ 
tration present [25]. The radionuclide concentration 
in all environmental media, including farm products, 
declined with time reflecting the ecological half- 
life of the contaminated territory. After 5-8 years 
(after the 95 Zr, 106 Ru, and 144 Ce had largely decayed 
away), only 90 Sr, and to a very small extent 137 Cs, were 
relevant for determining radiation dose to humans 
[26]. In response, soil-based management options 
were a focus of the remediation strategy adopted 
(Table 1). 

For some small areas with very high contamination 
levels, the decontamination of land by removal of the 
upper soil layer (5-10 cm depth) using bulldozers, 
graders, or scrapers with subsequent burial in special 
facilities was very effective (Table 2). This action was 
widely employed despite the high operational cost. 
Plowing rapidly after deposition greatly reduced exter¬ 
nal dose rate. Plowing of soil with the burial of the most 
contaminated upper layer (deep plowing) soon after 
the accident was one of the most effective methods that 
brought about a threefold reduction in external doses 
[74]. Plowing and burial were especially effective on 
chernozem soils with a thick humus horizon (up to 
0.5 m); on low-fertility soils it was less effective. The 
decontamination effect from deep plowing was 
enhanced by adding mineral fertilizers at balanced 
rates (according to plant demand for mineral nutri¬ 
ents) and liming where appropriate. These widely used 
techniques provided up to a tenfold reduction in 90 Sr 
concentrations in plants compared with conventional 
plowing [26, 74]. Normal plowing (20-25 cm of soil) 
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Table 1 Effectiveness of soil-based options for different 
crop types [24-26] 



a H g Hydrolytic acidity 
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Table 2 Summary of reduction factors of soil based 
options used in the former Soviet Union countries after 
the Chernobyl accident [16, 49, 52] 



Reduction factor 

Countermeasure 

137 Cs 

90 Sr 

Normal plowing (first year) 

2.5-4.0 


Skim and burial plowing 

8-16 


Liming 

1.5-3.0 

1.5-2.6 

Mineral fertilizers 

1.5-3.0 

0.8-2.0 

Organic fertilizers 

1.5-2.0 

1.2-1.5 

Radical improvement (first 
application) 

2.0-9.0 

(2.0-3.0) 3 

1.5-3.5 
(1.5-2.0) 

Removal of soil (external 
dose) 

1.5 

Not 

relevant 


a Multiple applications 
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for the first time after deposition achieved a 1.1-2.4- 
fold decrease in dose rate of y-radiation. A further 
reduction in 90 Sr uptake in plants (including fodder 
for animals) was achieved through cultivation of plants 
with low 90 Sr accumulation giving a roughly tenfold 
reduction in the main farm crops although the reduc¬ 
tion for some species was up to 60-fold [26, 74]. 

Management options to reduce 90 Sr accumulation 
in animal products mainly consisted of the provision of 
fodder with low amounts of 90 Sr to animals (clean 
feeding). The most effective feeds were based on pota¬ 
toes, root vegetables, and grain which had much lower 
90 Sr concentrations compared with feeds from unim¬ 
proved lands typical of preaccident farming practice. 
In specialized agricultural farms this clean feeding 
resulted in a decrease of 90 Sr content in meat and 
milk by two- to sevenfold and three- to fourfold, 
respectively compared with normal farms [74, 75]. 

A reduction in 90 Sr transfer to animal products was 
also achieved through the administration of Ca addi¬ 
tives or fodder rich in Ca (e.g., legume crops) resulting 
in a tenfold decline in 90 Sr concentrations in milk. Sr- 
90 reduction in meat was also achieved by preslaughter 
feeding with less contaminated fodder [26, 74]. 

Further reductions in the 90 Sr concentrations in 
food products and, hence, the internal doses to 
humans, was achieved by processing raw products. 
The processing of milk to fermented milk products 
and butter, which reduces 90 Sr concentration in butter 
20-fold compared with raw milk was widely used. Con¬ 
ventional processing of grain reduces 90 Sr concentra¬ 
tion in the final products by two to three times. Starch, 
alcohol, and vegetable oil were produced “clean” from 
the original contaminated agricultural products after 
the normally used processing [26, 74, 75]. 

Forest and Aquatic Ecosystems The key feature in 
the application of environmental management options 
in natural ecosystems affected by the accident was the 
identification and implementation of action limits for 
forest products including berries, mushrooms, game, 
and wood. The maximum permissible 90 Sr contamina¬ 
tion density for forested areas used for pasture and 
haymaking was set at 92 kBqm -2 and for the produc¬ 
tion of industrial wood at 3.7 MBqm -2 [26, 74]. Suc¬ 
cessful implementation of these options was achieved 
by transferring the priority right to use these lands to 


specialized farms, and controlling public access to 
those areas. 

Initial restrictions on the economic use of water 
bodies (fishing, use of aquatic vegetation as fodder for 
farm animals) were imposed on lakes in areas with 90 Sr 
density above 74 kBqm -2 [71]. By 1970, reutilization 
of lakes located in the affected area was possible for fish 
farming due to natural decontamination in lakes (the 
90 Sr ecological half-life in water was 5-6 years) 
(Alexakhin et al. 2004 [25]). The remediation strategy 
was based on the presumption that lands should not 
be left vacant (with the caveat that occupational stan¬ 
dards and dose limits to individuals must be observed) 
and their exploitation must partially compensate for 
damage from radioactive contamination. It was based 
on two main options: land use change and develop¬ 
ment of new economic activities, dependent on safe 
use of the environment. Land use differentiation into 
agriculture, forestry, and other branches of the rural 
economy using soil, plant, and water resources was 
based on the ability to produce products with permis¬ 
sible levels of 90 Sr contamination. Since farming was 
seriously restricted in the affected region (due to 
exceeding action levels), economic activities were 
focused on the production of industrial (nonfood) 
goods, development of forestry and local industry and 
the exploitation of peat, sand, gravel, and other mineral 
resources [26]. 

Effect of Remediation As a result of the remediation, 
a substantial decrease in the effective annual doses to 
the local population was achieved which ensured that 
these doses fell below those set out in the radiation 
safety standards present in the USSR at that time. The 
complex mixture of protective actions which was ini¬ 
tially developed for areas affected by the Kyshtym acci¬ 
dent was subsequently adopted for areas affected by the 
Chernobyl accident, with appropriate modifications to 
take account of the different circumstances [25, 70, 71] . 

Nuclear Weapon Accidents: Palomares Accident 

Although a number of accidents involving nuclear 
weapons are currently known to have occurred (with 
different extent of reliability) (see http://www. 
atomicarchive. com/ Almanac/B rokenarrows. shtml) , a 
limited number, in particular accidents with losses of 
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nuclear weapons from aircraft, required remediation 
efforts. 

In January 1966, near the south-eastern coast of 
Spain, two American air force aircrafts collided during 
air refueling. Of the four nuclear bombs that were 
onboard the aircraft, one was found in the Mediterra¬ 
nean Sea. The second bomb fell by parachute onto 
agricultural fields. The remaining two bombs were 
physically destroyed due to the explosion of their 
chemical detonators. Pyrophoric metallic plutonium 
caught fire and the resulting radioactive cloud, 
containing plutonium oxide, moved westward con¬ 
taminating an area of 262 ha within a region of inten¬ 
sive farming, occupied by rural dwellings, agricultural 
fields, and forested areas [19]. 

The main remedial management options were 
applied to the soil. The soil was removed, placed into 
containers, and shipped to the US for disposal at the 
Savannah River plant at a Pu deposition density above 
1.2 MBqm -2 . The total area of decontaminated soil 
was 2.2 ha, removed in 6,000 containers with a volume 
of 250 L each. Arable soil with Pu concentrations under 
1.2 MBqm -2 (17 ha) was re-plowed to a depth of 
30 cm. In rocky areas where plowing was impossible 
and Pu levels exceeded 120 kBqm 2 , soil was removed by 
hand and shipped to the USA [19]. 

Upon completion of mechanical decontamination, 
radiation monitoring of the air, soil, plants, farm ani¬ 
mals, and the population was carried out. The major 
farm crops produced in the affected area were toma¬ 
toes, barley, alfalfa, maize, pepper, and fruit such as 
melon. Contamination levels in agricultural products 
were not sufficient to cause serious exposures to the 
public. Twenty-three years after the accident approxi¬ 
mately 99% of the Pu still remains in the 0-5 cm surface 
soil layer [26]. 

The accident produced no serious health impacts, 
since doses were very low. The committed effective doses 
calculated for total Pu intake over 30 years amounted to 
37 and 210 pSv for urban and agricultural workers (the 
most exposed population group), respectively [76]. 

Chernobyl Accident 

The accident at the Chernobyl NPP was the most serious 
radiation accident in the history of nuclear energy gen¬ 
eration. The accidental release comprised some highly 


environmentally mobile radionuclides including iso¬ 
topes of radio iodine, radiostrontium, and radiocaesium, 
which had a serious impact on agriculture [12]. 

More than 15,000 settlements with a population of 
around six million people were living in areas with 
a 137 Cs deposition density above 37 kBqm -2 , and 
were therefore officially designated as contaminated. 
In the areas most severely affected, with 137 Cs contam¬ 
ination densities exceeding 555 kBq m -2 , there were 
640 settlements, with 270,000 inhabitants [61, 77, 78]. 

Soon after the accident, a 30-km radius exclusion 
zone (the CEZ) was established around the reactor. 
Further population relocations took place in subse¬ 
quent months and years in Belarus, the Russian Fed¬ 
eration and Ukraine. Eventually, 116,000 persons were 
evacuated or relocated [61] with about 50,000 cattle, 
13,000 pigs, 3,300 sheep, and 700 horses [12]. In the 
30-km zone, more than 20,000 agricultural and domes¬ 
tic animals, including cats and dogs, were slaughtered 
and buried. In the first few years after the accident 
comprehensive surveys of contaminated lands and 
on-site inspection of foodstuffs were carried out. In 
1989-1991, people from 470 settlements outside of 
the 30-km zone of the Chernobyl NPP were moved 
together with their agricultural animals to new loca¬ 
tions [61]. 

Although decontamination of some settlements 
was carried out within the first few years after 
the accident, the main challenge for remediation after 
the accident was to retain the capability for agricultural 
production in contaminated areas. This task was of 
paramount importance to preserve the economy 
and the social and psychological stability of the 
population [12]. 

Agriculture The maximum radionuclide concentra¬ 
tions in agricultural products occurred in the farms 
surrounding the Chernobyl NPP in the first few days 
and weeks after the accident and considerably exceeded 
action levels [12]. During the first few weeks after the 
accident 13: 1 was the main contributor to internal dose 
leading to significant radiation doses through milk 
consumption [61]. In addition, plants and animals 
were also contaminated with a wide range of other 
radionuclides, in particular, radiocaesium isotopes 
which were the dominant contaminant in most envi¬ 
ronmental samples and food products from June 1986. 
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In the first weeks after the accident, the main goal of 
countermeasures was to prevent the consumption of 
contaminated milk and green vegetables by the popu¬ 
lation, and to decrease 131 1 concentrations in milk [16]. 
This was largely achieved by (1) radiation monitoring 
of agricultural products and the rejection of foodstuffs 
in which 13: 1 activity concentrations were above action 
levels (3,700 Bql -1 at that time) and (2), processing 
rejected milk (mainly converting milk to storable prod¬ 
ucts such as condensed or dried milk, cheese, or but¬ 
ter). Information on countermeasures was confined to 
managers and local authorities, and was not distributed 
in time to private farmers. As a result, application of the 
countermeasures was often delayed in rural settlements 
for privately produced milk, resulting in low counter¬ 
measure effectiveness in some areas. 

The rural population around the ChNPP was the 
most heavily affected by both external and internal 
exposure pathways. The production of foodstuffs with 
radionuclide activity concentrations above action levels 
was considered by the population as evidence that the 
land and their lifestyle had been severely blighted. 
Therefore, the rapid deployment of optimized coun¬ 
termeasures was important not only to reduce the 
radiation doses but also for the social and psychological 
well being of the population of affected regions and 
helping to decrease their stress levels. 

In the least contaminated zone (37-185 kBqm -2 ) 
farming continued based on agricultural practices suit¬ 
able for the soil and climate, except for areas with peaty 
soils, with high radiocaesium uptake by plants, where 
radical improvement was implemented. In the 185— 
555 kBqm -2 zone, plant production on arable soils 
continued without restrictions. Soils received increased 
doses of P and K or organic fertilizers. Natural 
meadows on sandy, sandy-loam, and peaty soils were 
subjected to radical improvement (which includes 
removing vegetation, plowing, liming, fertilization, 
and plants reseeding) with a 1.5-fold increase in the 
application rate of potassium and phosphorous fertil¬ 
izers [12, 79] (Fig. 3). In the 555-1,480 kBqm -2 zone, 
there was a full-scale remediation. The areas receiving 
soil based management options in the 20 years follow¬ 
ing the Chernobyl accident are shown in Fig. 4 and the 
effectiveness of these actions is given in Table 2. 

The provision of feed with low contamination levels 
to previously contaminated animals for an appropriate 



Environmental Countermeasures and Restoration. 
Figure 3 

Implementation of soil based remedial actions (radical 
improvement) in Rivno region of Ukraine [upper photo) 
and Tula region of Russian Federation [photo in the mid). 
Lower picture shows a meadow after radical improvement 
(Bryansk region, Russia). (Courtesy of Russian Institute of 
Agricultural Radiology and Agroecology Obninsk, Russia and 
Ukrainian Institute of Agricultural Radiology Kiev, Ukraine) 
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Environmental Countermeasures and Restoration. Figure 4 

Changes with time in the extent of agricultural areas treated with liming (a) and mineral fertilizers (b) as a countermeasure 
in the USSR/fSU countries contaminated by the Chernobyl accident [12] 


period before slaughter effectively reduced contamina¬ 
tion in meat. Such “clean feeding” was normally com¬ 
bined with live monitoring of animals so that if 
radiocaesium transferred to meat was above action 
levels, the animals were returned to the farm for further 
clean feeding (Fig. 5). 

Another important countermeasure used since the 
beginning of the 1990 s to reduce radiocaesium con¬ 
tamination in animal products was administration of 
cesium binders (Fig. 6). The Cs binder used was 
a hexacyanoferrate, widely referred to as Prussian blue 
[80] and was administered as a powder and as rumen 
dwelling boli. 

This application has been especially useful and 
effective in settlements where the grazed meadows 
were not suitable for radical improvement. Various 
methods for milk processing to butter and sour-milk 
products were also developed and successfully 
implemented during food processing. 

Forest and Aquatic Ecosystems More than 
40,000 km 2 of forested areas had 137 Cs concentrations 
above 37 kBqm -2 and in Russia alone about 50,000 
forestry workers and members of their families lived in 
these affected areas [25]. 

In many of the most affected areas, the role of the 
forest as a source of food is essential to local 


communities [81]. Natural reductions in the 137 Cs 
concentrations in “forest products” was much slower 
compared with agricultural products; the effective half- 
lives for 137 Cs in mushrooms and berries were roughly 
20 and 10 years, respectively [82, 83] . As a consequence, 
in the long term after the accident “forest products” 
(especially mushrooms) sometimes contributed more 
to the exposure of a population than did critical food¬ 
stuffs such as milk. This was particularly prevalent in 
settlements near forests [14, 39, 81]. 

Remedial actions in forests were mainly of 
a restrictive nature and the dose reduction achieved 
by various management options was small [84, 85]. 
Thus, in the affected forested areas, bans were imposed 
on the collection of mushrooms, berries, hunting 
game, and private cattle grazing on forest pastures 
and clearings; fire-control actions were also 
implemented. Management options in contaminated 
woods (such as the collection and removal of forest 
litter and leaves or reforestation) were either techno¬ 
logically ineffective, or economically expensive and 
ecologically unjustified. 

The Chernobyl accident also affected many water 
bodies. Some 8 million people were exposed to addi¬ 
tional radiation due to the consumption of drinking 
water and 32 million due to the consumption of fish 
and the use of water for irrigation purposes [61, 86]. 
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Environmental Countermeasures and Restoration. 
Figure 5 

Live monitoring measurements of animals in the 
settlements of the Bryansk region Russia affected by the 
ChNPP accident (Courtesy of Russian Institute of 
Agricultural Radiology and Agroecology, Obninsk, Russia) 


Two major remediation strategies were 
implemented in water bodies affected by the Chernobyl 
accident: the implementation of remedial options to 
restrict radionuclide distribution with water move¬ 
ments; and restrictions on the consumption of aquatic 
foodstuffs and drinking water from the most contam¬ 
inated water bodies. In the contaminated catchment 
area different hydro-technical constructions were 
implemented (dikes, dams, etc., as well as special chan¬ 
nels). Also, to limit radionuclide migration and accu¬ 
mulation in fish, various sorbents were added to induce 
radionuclide sedimentation from water masses, and 
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Figure 6 

Administration of the hexacyanoferrate boli to 
a sheep (Courtesy of Centre for Ecology & Hydrology, 
Lancaster Environment Centre, Lancaster, UK) 

potassium fertilizers were added to reduce radionuclide 
accumulation in fish [87, 88] (see example in Fig. 7). 
These measures were rather costly and their effective¬ 
ness was generally low, especially compared with agri¬ 
cultural remedial options [26, 61]. 

Urban Ecosystems The decontamination of settle¬ 
ments following the Chernobyl accident was applied 
at a limited scale. For example, the Soviet army in 1989 
decontaminated 93 rural settlements affected by the 
Chernobyl accident in the Bryansk region. The army 
units removed a layer of topsoil from areas around 
houses, from children gardens, and other public sites 
[54]. However, the dose reducing effect was only 1.1- 
1.5; which was much lower than expected [14]. There¬ 
fore, decontamination of inhabited areas ceased and 
relocation of the residents was continued in some cases. 
In later studies, carried out in 1997 it was shown that if 
soil removal is optimized in relation to the distribution 
of the contaminants in soil profile (e.g., using a mini¬ 
bulldozer, see Fig. 8), the external dose rate can be 
reduced by at least a factor of 6 [54, 89]. 

Effect of Remediation Countermeasures applied in 
the early phase of the Chernobyl accident had 
a considerable effect in averting doses to the affected 
population. However, they were only partially effective 
in reducing radioiodine intake via milk, because of 
the lack of timely information about the accident and 


E 





3538 


E 


Environmental Countermeasures and Restoration 



Environmental Countermeasures and Restoration. Figure 7 

Application of potassium chloride fertilizer for reduction of 137 Cs concentration in the fish of the Svaytoe Lake (Belarus) 
(left photo) and the lake after the treatment (right photo). The fertilizers were colored red with a special neutral 
additive to track a potassium chloride application 



The contribution of internal exposure to radiation 
doses of the general public caused by the Chernobyl 
radioactive release has been significant both in the 
short and long term. Agricultural management options 
were the most efficient in reducing dose, averting 
30-40% of the collective internal dose to the affected 
population, excluding thyroid dose [12]. 

The current slow, natural decrease of 137 Cs avail¬ 
ability in soil means that remediation actions after the 
Chernobyl accident will be important for around 
70 years after the accident except for those areas (such 
as the 30-km zone around Chernobyl NPP) which have 
been withdrawn from economic use and for which this 
time period is longer. 
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Figure 8 

Removal of contaminated topsoil with a "Bobcat" mini¬ 
bulldozer in the Bryansk region in 1997 [54] 


advice on appropriate actions, particularly for private 
rural farmers [61]. 

Large-scale application of a range of management 
options achieved a sharp decrease in the amount of 
animal produce with radiocaesium activity concentra¬ 
tions above action levels in all three countries (Fig. 9). 


Goiania Accident 

A serious radiation accident occurred in September 
1987 in Goiania in Brazil. One of the hospitals in 
Goiania possessed a 50.9 TBq (1,375 Ci) 137 Cs 
( 137 CsCl) sealed source for therapeutic applications 
[2, 3, 90]. Several unemployed young people pene¬ 
trated the unguarded building and stole the device. 
They thought that the 50-kg metallic object was made 
from a valuable material so they separated the source 
from its biological shield and opened the source con¬ 
tainer. The source was sold at a junk yard; the junkyard 
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Environmental Countermeasures and Restoration. Figure 9 

Variation with time in amounts of milk and meat exceeding action levels in Russia (all milk and meat - collective 
and private), Ukraine and Belarus (only milk and meat entering processing plants) after the Chernobyl accident, tones [12] 


owner considered the material to be precious because 
of its blue luminescence. Dispersion of the radioactive 
substance then occurred as people came to look at the 
material, some of whom then took away pieces of it. 
Some people took parts of the source as souvenirs; 
others painted their faces with the blue powder for 
carnival. 

As a result, the source was broken into small frag¬ 
ments and widely distributed. Radioactive substances 
were dispersed by high winds over a variety of vegetable 
gardens, houses, and roofs of buildings. Radioactive 
material even reached the main river in the region, 
the Meia-Ponte River with 137 Cs found in bottom sed¬ 
iments 12 km from the source site. In this incident 
some 250 persons were exposed to external and internal 
irradiation; 50 of which developed symptoms of whole- 
body and acute local irradiation; 14 of these had 
marrow failure. Doses of internal exposure in affected 
people ranged from 0.046 to 0.97 Gy [3]. 

Remediation actions for the Goiania accident were 
carried out with international assistance. Some 755 
professionals took part in remediation of the contam¬ 
inated sites and were exposed to differing extents. 
Doses to around 70% of the participants were below 
1 mSv and the maximum dose value reached was 
16 mSv [91]. Very high contamination was found in 
85 houses and residents from 41 houses were 


evacuated. The total area subject to radiation monitor¬ 
ing within the urban territory covered 67 km 2 . In the 
city a y-survey was performed by helicopter, followed 
by another y-survey performed by vehicle. 

The limits of contamination density chosen for the 
remediation actions in Goiania (3.7 Bqcm -2 or 37 kBq 
m -2 ) were the same as for the classification of the 
Chernobyl (1986)-affected area (areas with 137 Cs levels 
37 kBqm -2 or 1 Cikm -2 ). Once contaminated sites 
had been identified, more labor-intensive options 
were implemented. Various chemical solvents and 
abrasives were used for mechanical decontamination 
and radionuclide-containing materials were physically 
removed [92]. 

The identification of potential locations to store the 
collected radioactive materials was met with a highly 
negative public response. Ultimately, a site for tempo¬ 
rary storage was chosen in the thinly populated town of 
Abadia de Goids (20 km outside the city of Goiania). In 
the most affected areas houses were destroyed, soil was 
removed, and the decontaminated site was covered 
with concrete. In less-affected buildings, walls, floors, 
and roofs were decontaminated. Personal items that 
contained radioactive substances were designated as 
waste, since people refused to use them even after 
decontamination. In addition to houses, 45 public 
places including parts of streets, squares, parks, and 
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shops were decontaminated. By the end of December 
1987, the main affected sites were decontaminated. 
Remediation after the Goiania incident resulted in the 
bulk of the 137 Cs source (50.9 TBq) being localized in 
the 1.6-km 2 waste disposal area. The remediation in 
Goiania thus largely comprised of the mechanical 
cleanup of contaminated territories, collection of vari¬ 
ous contaminated materials which were later assigned 
to radioactive waste, and their disposal at specially 
allocated storage sites [93]. 

Remediation of Sites Affected by Nuclear Testing 

Radioactive contamination is often the result of human 
actions, the consequences of which were not fully 
understood at the time, as is the case for nuclear testing 
sites. Nuclear tests were conducted at a total of 16 sites, 
the most well known of which are the Nevada Test Site 
(the USA), Bikini and Enewetak (Marshall Islands), 
Emu, Maralinga, and Monte Bello Islands (Australia), 
Mururoa and Fangataufa (French Polynesia), Semipa- 
latinsk (Kazakhstan), Novaya Zemlya (Russia), and 
Lop Nor (China) [6, 78, 94-97]. In total, about 29 
mountains of fission yield were associated with radio¬ 
nuclides locally deposited at the test sites [98] . Another 
source of local contamination at the test sites were 
safety tests, in which nuclear weapons were destroyed 
using conventional explosives to simulate a possible 
accident. 

Only two of these sites (Enewetak and Maralinga) 
have been remediated. Remediation at the other test 
sites was either not carried out or only partly 
performed, either because of low residual contamina¬ 
tion (fractions of long-lived radionuclides in many 
types of nuclear explosions are very low), high financial 
costs, the remoteness of the site from populated areas 
or other political reasons. Even when a full remediation 
strategy was not applied at these sites, some remedial 
actions were usually conducted, especially where con¬ 
siderable plutonium residues resulting from safety tests 
were present [96]. 

Bikini Atoll Test Site 

Nuclear tests were conducted at Bikini Atoll from 1946 
to 1958. The regional fallout from the Bravo test caused 
widespread contamination of the Bikini Atoll and 
forced the evacuation of Marshallese people living on 


Rongelap and Utrik Atolls [96]. About 50% of the 
fission yield associated with near-surface nuclear deto¬ 
nations was deposited on a local or regional scale [99]. 

The 167 inhabitants of Bikini Island were evacuated 
prior to the first nuclear test in 1946. In August 1968, 
resettlement of the Bikinian people on the atoll was 
allowed and about 100 people returned to the atoll. 
Because of concerns expressed by Bikini residents 
about the safety of living on the atoll, in 1975-1978 
a set of additional investigations were carried out gath¬ 
ering data for more precise dose estimates [100]. In 
1978, a tenfold increase in the body content of 137 Cs 
compared to those measured in 1970 was identified, 
based on whole body measurements [101] which were 
associated with consumption of coconut fluid due to 
the limited availability of drinking water. In August and 
September 1978, in response to the uptake of cesium in 
the population, the Bikinians who had returned to 
Bikini Atoll were relocated once again [96]. 

The total annual dose to the potential residents of 
the atoll (above the natural background dose) was 
estimated in 1999 to be 4.0-15 mSv, depending on 
the ratio of local to imported foods in the diets of 
inhabitants. Cs-137 was identified as the main radio¬ 
nuclide contributing to the dose, while contributions of 
90 Sr, 239+240 Pu, and 241 Am were much lower. The intake 
of marine foods, stored rain water and groundwater, 
and inhalation of resuspended soil together accounted 
for less than 1% of the dose [102]. 

Although full-scale remedial actions at the site have 
not been applied, a few remedial options for Bikini 
were implemented and studied in detail. The interven¬ 
tions were based on projected dose to a hypothetical 
critical group. The assessed dose of 15 mSv [103] was 
higher than the action dose level for intervention of 
10 mSv accepted at that time [29]. Therefore, the 
implementation of remediation was considered to be 
highly justified. The key remedial options used were 
(1) washing-out 137 Cs from the soil with large quanti¬ 
ties of salt water, (2) application of ameliorants such as 
zeolites to soil to make 137 Cs less available for plant 
uptake, (3) planting vegetation with high biomass to 
remove 137 Cs from the soil and then burial of the 
contaminated biomass, (4) soil removal (40 cm top 
layer), and (5) treating soil with potassium fertilizers [96] . 

Soil removal and treatment with potassium fertil¬ 
izers were identified as the most effective measures. 
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However, soil removal would necessitate moving about 
30,000 mature coconuts, pandanus and papaya trees, 
and breadfruit and was therefore a complex and 
extremely expensive operation. In contrast, soil 
removal from dwelling areas would be more cost effec¬ 
tive and the best option was to remove soil from 
around each house and replace it with a layer of 
crushed coral to minimize external exposure and pos¬ 
sible ingestion of the remaining soil. Consideration was 
also given to the potential ecological effects of the 
proposed potassium fertilizer treatment; at the planned 
levels there was no significant alteration expected in the 
soil chemistry and the potential transfer of potassium 
to groundwater was very low [104]. Application of 
potassium fertilizer would be needed every 4 or 5 
years to maintain the low 137 Cs activity concentrations 
in local foods. It has been estimated that after the 
potassium treatment of soil and the replacement of 
soil from around the living areas, the dose rate based 
on the assumption that residents were consuming only 
local food would be reduced by as much as a factor of 
10 or slightly higher compared with the levels before 
remedial actions application [105]. 

Maralinga Test Site 

Tests of the UK nuclear program at the Maralinga test 
site located on the Nullarbor Plain in South Australia 
began in 1955. The site was abandoned in 1967 follow¬ 
ing the advent of the Nuclear Non-proliferation Treaty 
and the banning of atmospheric nuclear tests because 
the site was not suitable for underground tests. Seven 
nuclear tests were performed at the site, with approx¬ 
imate yields ranging from 1-27 kt. The site was also 
used for hundreds of minor trials from 1955 to May 
1963, many of which were safety tests intended to 
investigate the effects of fire or nonnuclear explosions 
on atomic weapons [106]. 

Since 1967, numerous surveys have been carried out 
to characterize the contamination within the site. Sub¬ 
stantial contamination densities were found at many test 
sites, particularly at Taranaki site where the Vixen 
B nuclear test was performed. Several square kilometers 
of land were contaminated to levels exceeding 300 kBq 
m 2 of 239 Pu. Some limited areas at the site were 10- or 
100-fold more contaminated. Initially, plutonium con¬ 
tamination consisted of the following isotopes: 238 Pu, 


239 Pu, 240 Pu, and 241 Pu; radioactive decay resulted in 
241 Pu being gradually replaced with 241 Am [96]. 

The first efforts to remediate the Maralinga site 
began in 1964. Initially, contaminated debris and 
most of the remaining infrastructure were buried in 
pits at a depth of 2-3 m. The Pu content of each of these 
pits was not well quantified, but was estimated to be in 
kilograms quantities [106]. In 1967, further steps were 
taken to reduce the plutonium surface concentrations 
by turning over and mixing the surface soil. Following 
the remediation the site was closed with a series of 
fences erected to enclose the burial pits containing 
significant quantities of plutonium at Taranaki and 
other sites [96]. 

Later assessments made for nomadic Aborigines 
living mainly an outdoor lifestyle identified the inha¬ 
lation dose pathway as the contributor to the total 
doses to both adults and children [107]. As a result of 
this survey, a much more extensive remediation project 
was initiated at the site, within which three sets of 
criteria were established for making decisions on the 
remediation process, namely: 

1. Contaminated soil was removed where the levels of 
dispersed 241 Am exceeded 40 kBqm -2 averaged 
over 1 ha, or where contaminated particles exceed¬ 
ing 100 kBq were found, or where the density of 
particles exceeding 20 kBq was greater than 1 in 
10 m 2 (Soil-Removal Criteria). 

2. Once soil was removed the residual levels of dis¬ 
persed contamination in the cleared area was not to 
exceed 3 kBqm -2 for 241 Am, averaged over 1 ha, 
and particulate contamination was to meet the 
Soil-Removal Criteria (Clearance Criteria). 

3. Permanent occupancy and unrestricted land-use 
was only to occur where levels of dispersed contam¬ 
ination were less than 3 kBqkm -2 for 241 Am, aver¬ 
aged over 3 km 2 , and the particulate contamination 
met the Soil-Removal Criteria (Unrestricted Use 
Criteria) [108-110]. 

To reduce the inhalation risk, the remediation 
criteria were guided by conservative principles and 
estimation of doses by realistic scenarios. These 
included the possibility of an Aboriginal group living 
for an entire year on the edge of the nonresidential area 
in regions of the highest activity permitted (approxi¬ 
mately 3 kBqm -2 of 241 Am). 


E 




3542 


E 


Environmental Countermeasures and Restoration 


The first stage of the remediation project consisted 
of defining the boundaries at the sites contaminated 
with Pu (Taranaki, TMs, and Wewak), followed by bulk 
removal of contaminated soil from the three sites and 
burial in excavated burial trenches under at least 5 m of 
clean rock and soil. At the end of the remediation 
process, a compliance assessment was carried out to 
ensure that the whole Maralinga area had been made 
safe following the work undertaken during the 1994- 
2000 Maralinga project [96]. 

Following the remediation by removal and burial 
at depth of contaminated surface soil, all areas at 
Maralinga were shown to be within acceptable 
absorbed dose limits for all envisaged land uses. The 
restriction on permanent occupancy within the 
“restricted land-use” (nonresidential) boundary sur¬ 
rounding Taranaki was set as a precautionary measure 
as the inhalation doses for permanent occupancy of all 
but a few areas (essentially within the untreated 
plumes) were well below the 1 mSv a -1 limit for mem¬ 
bers of the public [96]. 

Remediation of Sites within Decommissioning 
of Facilities 

The use of the term decommissioning implies that no 
further use of the facility (or part thereof) for its existing 
purpose is foreseen [111]. Decommissioning actions are 
taken at the end of the operating lifetime of a facility to 
remove it from service with due regard for the health and 
safety of workers and members of the public and the 
protection of the environment. Subject to national legal 
and regulatory requirements, a facility or its remaining 
parts may also be considered decommissioned if it is 
incorporated into a new or existing facility, or even if the 
site on which it is located is still under regulatory control 
or institutional control [112]. 

The actions taken during decommissioning will 
need to be such as to ensure the long protection of 
the public and the environment, and typically include 
reducing the levels of residual radionuclides in the 
materials and the site of the facility so that the materials 
can be safely recycled, reused, or disposed of as exempt 
waste or as radioactive waste, and the site can be 
released for unrestricted use or otherwise reused [111]. 

Remediation can be applied to both nuclear and 
nonnuclear contaminants that can affect human health 


and the environment. Remediation encompasses inter 
alia site characterization, identification of remedial 
action alternatives, implementation of a remedial 
action, and ongoing monitoring to assure the confine¬ 
ment or containment of residual contamination. 

There are many facilities with different characteris¬ 
tics around the world contaminated with radionuclides 
and therefore, according to the legislation, require 
decommissioning. They can include nuclear power 
reactors, radiochemical processing facilities, uranium 
and thorium mines, research institutions and hospitals, 
and many other human activities [28]. Therefore, 
remediation activities can be also very different and 
vary from decontamination of selected pieces of 
equipment to large-scale soil removal near some 
uranium-mining facilities or demolition of large con¬ 
crete structures and placing them into final or interim 
waste storage [111]. 

In contrast, nuclear laboratories or institutions 
with a good standard of radiation protection can be 
decommissioned after limited remediation efforts 
although in all cases, the decommissioning process 
must be well planned and sufficient resources must be 
available. Over time, without proper arrangements 
being made for decommissioning, shut down facilities 
deteriorate and ultimately constitute a radiological 
hazard in their vicinity from direct external exposure 
to radiation to the public and potential release of 
radioactive material to the environment [112]. Cur¬ 
rently, a substantial discharge of radionuclides has 
only been observed within a few decommissioning 
projects, largely at uranium-mining and uranium¬ 
milling facilities [113]. 

Remediation goals are usually based on dose con¬ 
straints and derived action levels expressed in concen¬ 
trations of radionuclides in various media. Because of 
the high cost, remediation is more often aimed at 
achieving “restricted reuse” or “fit-for-purpose 
reuse” as opposed to “unrestricted reuse,” though the 
latter would be the ideal goal. The remedial objectives 
for any restricted reuse are then based on meeting 
appropriate dose criteria associated with the intended 
land reuse. Where these doses are within the accept¬ 
able limit, regardless of the future uses of the site, then 
this enables a site to be released from any controls [61] . 
However, material originating from a released site 
needs to comply with the national requirements for 
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radiation protection for materials outside the scope of 
regulatory control [114]. 

The above aspects clearly show the importance of 
implementing a forward-looking planning process. 
While in the past, decommissioning and remediation 
activities have often been carried out as independent 
activities, optimization of effort, cost, impact, and risk 
can be achieved by carrying out decommissioning and 
remediation activities in an integrated manner as 
described in Falck et al. [112]. 

The high public concern regarding nuclear safety 
issues and the intention to build confidence to the 
remediation success has led to an acknowledgement 
of the need for stakeholder involvement. A system of 
the activities intended for providing transparency in 
remediation can be summarized as follows: 

1. Development of an effective public- 
communication project policy (openness, transpar¬ 
ency, truthfulness, effectiveness, timeliness etc.) 

2. Development of the stakeholder involvement goals 
(no surprises, confidence generating) 

3. Identification of relevant stakeholders to be 
involved into the decision conference 

4. Selection of communication, consultation, and 
participation methods for training of project staff 

5. Monitoring and review of success through local 
attitude survey; feedback from the stakeholders of 
different levels 

As noted earlier, there are many differing facilities 
with distinct remediation goals, required remedial 
actions and further planned use of the contaminated 
site. Decommissioning of the Wismut uranium mine in 
Germany, based mainly on overview of remediation 
activities within decommissioning is an example of 
successful remedial activities typical for the 
decommissioning process. 

Uranium mining in Saxony and Thuringia, known 
as Soviet-German Company (SDAG) Wismut, was in 
operation for more than 40 years. During that time, the 
Wismut company had mined around 231,000 t of ura¬ 
nium. Due to poor disposal practices in the first years 
of the operation, and routine natural radionuclides 
releases thereafter, vast areas mainly within the terri¬ 
tory of the facility were contaminated. In total, the area 
of the Wismut Company covered 36.9 km 2 , including 
24.4 km 2 of mine dumps, 7.2 km 2 of tailings ponds, and 


1.6 km 2 of open pit mine. The contamination of the site 
was represented by different radioactive materials, 
including: (1) scattered ore, (2) scattered tailings mate¬ 
rial (specific activity around 1-10 Bqg -1 for 226 Ra as 
the key nuclide, not in radioactive equilibrium), 
(3) lost uranium concentrate, (4) naturally leached 
uranium, for instance at sites from which waste 
dumps were removed [112]. 

In 1991, following integration of West and East 
Germany when the Federal Republic of Germany took 
responsibility for the SDAG Wismut Company, and the 
corresponding nuclear legacy, the Wismut Remedia¬ 
tion Project was started. The project involves remedial 
activities at many sites, including four underground 
mines and two processing plants. In 1991, for a reuse 
of the sites, buildings, and materials contaminated by 
uranium mining activities, the German Commission 
on Radiological Protection recommended a reference 
level of effective dose of 1 mSv a -1 . 

The extent of remedial measures which were under¬ 
taken at the site was justified based on site evaluation and 
remediation feasibility studies, rather than by applying 
uniform standards. Remediation of the Wismut legacy 
was viewed as an opportunity to return the affected land 
to productive use, thus enhancing the revitalization of 
the former mining areas in East Germany [112]. 

The Wismut site remediation took advantage of the 
open space available in unflooded mines and other sites 
to dispose of contaminated material in situ. The places 
used for disposal of contaminated material included: 

- Underground storage sites (“dry” mine galleries 
above the groundwater level) 

- Waste rock dumps (construction of cassettes, place¬ 
ment of the waste, coverage) 

- Beach areas of tailings ponds (same technology as 
for dumps) 

- Engineered hazardous waste disposal facility 
(constructed on top of a waste rock pile) 

Special attention was given to the operational waste 
that resulted from the application of modern remedia¬ 
tion technologies for the treatment of contaminated 
material. The contamination was concentrated 
in residues, reaching specific activities of up to 
1,000 Bqg“ 1238 U and 20 Bqg“ 1226 Ra. Special tech- 
niques for the immobilization of these residues needed 
to be applied. Despite the lack of the space available for 
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disposal, separation of waste according to type and 
level of contamination and further recycling of low 
contaminated or uncontaminated materials were the 
key elements of the waste management practice at the 
Wismut site. In particular, salvaged uncontaminated 
metallic scraps and scraps with a total surface alpha 
activity lower than 0.5 Bqcm -2 have been sold for 
smelting [112]. 

Former waste rock piles cannot always be completely 
remediated and they can be released for restricted reuse. 
In particular, such areas could be used for planting new 
forests, or in some cases even for fodder production or 
various industrial or trade applications. Forestry was an 
attractive cost-effective option which was considered 
sustainable in the long term. 

Considerable efforts were made over the past years 
for the benefit and protection of the population 
inhibiting areas affected by mining activities at the 
Wismut site. The significant reduction of adverse envi¬ 
ronmental impacts and the reutilization of these areas 
for industrial use, housing construction, or as a nature 
reserve have revitalized these regions. The “New Hori¬ 
zons through Remediation” program became a key 
element of Remediation. Integrated state-of-the-art 
technologies from around the world were applied to 
conduct decommissioning and remediation of the 
radioactive legacies left by the uranium mining in an 
efficient and cost-effective way leading to sustainable 
solutions. Remediation has had a benefit for regional 
development at two former mining towns, leading to 
rebirth of the health spa of Schlema (Fig. 10) and the 
development of the Federal Garden and Landscape 
Exhibition in 2007 (BUGA, 2007) to be hosted by the 
towns of Ronneburg and Gera [112]. 

Overall, the Wismut Project provides evidence of 
the potential success of a value-added approach with 
no high additional (or not reimbursable) costs, if reme¬ 
diation is efficiently undertaken with a well-defined 
reutilization in mind. Such successful utilization goals 
can only be developed in cooperation with the envis¬ 
aged future user (i.e., municipality or developer), the 
community and the regulatory authorities. If consen¬ 
sus is achieved with the stakeholders prior to remedial 
actions, then “remediation by objectives” becomes 
practicable, i.e., the objectives for the individual reme¬ 
dial steps can be determined, consistent with the ulti¬ 
mate utilization goal [112]. 
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Figure 10 

Former mining area near Schlema before (1991) and after 
(2009) remediation (http://www.wismut.de/) 


Decision-Aiding Tools 

The efficiency of countermeasures or remediation 
depends on many factors: the specific contamination 
scenario, the properties of the contaminants involved, 
characteristics of the affected environment, the percep¬ 
tion of countermeasures or remedial actions by the 
local population amongst many others. The effective¬ 
ness of such actions, cost, and resources required for 
their implementation vary considerably with these var¬ 
ious factors; this in turn affects the suitability of differ¬ 
ent countermeasures or remedial options for a given 
situation. Therefore, generalized recommendations 
which do not account for the diversity of local site- 
specific conditions can result in inadequate decision¬ 
making and may not be feasible to execute. 
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These multiple factors must be reconciled within 
a single optimization approach and there is 
a significant challenge in finding a balanced solution 
to the multidimensional problem each contamination 
scenario presents. This task can be facilitated with 
environmental decision support systems (EDSS), capable 
of providing advice on countermeasure/remediation 
strategies at different levels of the decision-making 
process, which take into account temporal and spatial 
variation in each factor. A conceptual framework and 
basic elements of such systems (EDSS) intended for 
support in remediation of contaminated areas is given 
in Fig. 11. 

Following the Chernobyl accident, various EDSS, 
or models which underpin them, have been produced, 
such as FORCON [115], SAVE [40], RESTORE 
[116, 117], and CESAR [118] which have enhanced 
the ability to optimize remediation strategies in con¬ 
taminated areas. 

Decision-aiding techniques are useful tools to 
determine the level and kind of protection which will 
provide the best solution given the various factors and 
constraints involved, taking into account inherent 
uncertainties and value judgments. This can only be 
achieved on the basis of a clear identification of 


Characteristics of contaminated area 

(Political, legal and administrative conditions, constrains in remediation) 



Decision making process 
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Figure 11 

Key elements of the environmental decision support 
system (EDSS) for remedial measures on contaminated 
areas (Adopted from Shershakov et al. [13]) 


available alternative management options and the fac¬ 
tors and constraints involved in the processes. 
Site-specific quantification is a difficult and time- 
consuming step in the decision-making process 
because all necessary data must be gathered or gener¬ 
ated by models to then use in combination with 
decision-aiding techniques to optimize the decision¬ 
making process [119]. Decision-aiding techniques have 
been widely used to select preferable remedial options 
for contaminated areas, starting in the early 1980 s 
when ICRP recommended the use of cost-benefit ana¬ 
lyses [34, 35]. In these analyses, the cost of environ¬ 
mental deterioration as a result of radiological 
contamination which could be averted using protective 
measures was compared with the cost of the protective 
measures [34, 35, 120, 121]. For decisions of greater 
complexity, including considerations of large-scale 
remedial, decision-making can be improved by 
complementing the above methods with multi-criteria 
analysis [13]. 

Cost-effectiveness and cost-benefit techniques are 
based on balancing the costs of management option 
implementation against the expected reduction of the 
collective dose, which is expressed in monetary value. 
The definition and use of the monetary value of collec¬ 
tive doses, introduced by ICRP in Publication 22 [ 120] , 
has been a key element of this approach; the imple¬ 
mentation of this approach has been, and currently is, 
the subject of many debates on ethical grounds. 

There are some clear differences between these 
two techniques. Cost-effectiveness analysis is intended 
mainly to eliminate low cost-effective management 
options and to rank the remaining options (see 
Fig. 12). The main elements of the approach are as 
follows: 

1. Characterization of management options in terms 
of their cost and the corresponding residual collec¬ 
tive doses following their implementation. 

2. Selection of cost-effective options for which there 
are no alternative options either to reach the same 
residual collective dose at a lower protection cost or 
for the same protection cost to reach a lower resid¬ 
ual collective dose. 

This process is illustrated graphically in Fig. 12. 
Each management option is represented by a dot and 
all those dots situated along the cost-effectiveness curve 
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Figure 12 

Cost-effectiveness analysis (Adopted from Lochard and 
Schneider [122]) 

represent cost-effective options. For example option 
(A) allows for an equivalent residual collective at 
a lower cost than (E), and option (C) gives a lower 
residual collective dose for the same cost as (F). All the 
options which are above on the curve are called “totally 
dominated” (worse) options and must be discarded 
from further consideration (Lochard and Schneider 
2001 [ 122 ]). 

Cost-effectiveness is intended to assess the “mar¬ 
ginal cost” of every management option, which has to 
be compared against the closest less or more expensive 
options. If a small additional cost would lead to a much 
higher effectiveness in terms of risk reduction, the 
option is not cost effective. Finally, each cost-effective 
option can be characterized by the increase in cost from 
one option to the next (DX) and the corresponding 
decrease in collective dose (DS). The quotient (DX/DS) 
is called the cost-effectiveness ratio and provides 
a basis for ranking the various options. However, deter¬ 
mination of the cost-effectiveness curve and the 
corresponding cost-effectiveness ratios does not pro¬ 
vide a basis for the selection of the optimum option. 
This is achieved by the introduction of a reference value 
for the cost-effectiveness ratio in the framework of the 
cost-benefit analysis [122]. 

The most widespread approach to performing 
the second technique, cost-benefit analyses is based 
on the determination, for each management option, 


Cost 
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Figure 13 

Cost-benefit analysis applied for the identification 
and justification of the optimal management option [34] 


of the sum of remediation costs and the cost of residual 
collective dose (expressed in monetary terms) resulting 
from that strategy, and then comparing them to iden¬ 
tify which option has the lowest total cost. This option 
can be accepted as the optimum solution, as illustrated 
in Fig. 13 [34]. 

There are many factors to be taken into account as 
part of the process of remediation justification and some 
of them cannot be quantified either in monetary terms 
or in terms of any resources, although they may be 
quantifiable in other units or ranking parameters. In 
such a case the most straightforward way to justify an 
optimized remediation strategy is the application of 
multi-attribute-utility analysis (MAUA). This technique 
was first recommended by ICRP for the optimization 
of radiation protection efforts in ICRP Publication 55 
[35]. The most recent ICRP Publications 103 and 104 
emphasize the great impact of social and political fac¬ 
tors on decisions regarding remediation which also 
encourage the use of MAUA as one of the basic 
approaches for remediation optimization [10, 31]. 

The basic principle of the multi-attribute-utility 
analysis technique is to construct a scoring scheme 
(or multi-attribute-utility function) for each option 
on the basis of all significant criteria which characterize 
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the situation (i.e., cost of remediation, collective or 
individual doses, perception of management options 
by the population, etc.). When the different remedial 
options are identified, the MAUA defines the relevant 
criteria for the decision process. The criteria are spec¬ 
ified in compliance with specific strategies to ensure 
remediation success. These criteria normally include: 
economic attributes, human health impact and public 
safety attributes, socio-psychological attributes, envi¬ 
ronmental secondary effects and ethical considerations, 
criteria based on public opinions, and the interests of 
various population groups. 

Every remediation alternative must be evaluated 
according to some or all of the above criteria, either 
quantitatively or qualitatively. Because of their diver¬ 
sity, possible options will be ranked differently for each 
criterion. In further steps, weighting-coefficients are 
assigned to each criterion to account for their relative 
importance. This is the most important and often 
difficult step in MAUA. Nevertheless, several tech¬ 
niques are available to derive this set of values, and 
whatever method is used, the choice of weighting coef¬ 
ficients should be justified. Formally, each option is 
qualified by its total utility ( U) calculated as follows: 

Ui = ^ ^ kj • Ujj, 

Where i is the remediation alternative number; j is 
the criterion number; kj is the weighting coefficient 
showing the relative importance of each option (nor¬ 
malized Yj kj- 1); Ui is the utility of measure i. 

The single utility associated with each criterion can 
be defined either as a linear function of the value 
expressing the criteria or as a nonlinear function to 
incorporate the preferences of the decision makers into 
the analysis. For example, it is possible to define utility 
functions which incorporate risk aversion according to 
the level of individual or collective exposures [35]. 

Finally, the management options which lead to the 
highest total utility should be selected. Since most of 
the weighting factors generally rely on decision makers’ 
judgments, it is highly recommended that a sensitivity 
analysis be performed using different sets of weighting 
coefficients to test the “robustness” of the results. 

Based on such an approach, a few specific decision- 
support systems for remediation of different 


environments: aquatic, forest, and rural were suggested 
and used for the long-term optimization of remediation 
strategies following the Chernobyl accident [85, 
123-127]. 

Future Directions 

Different nuclear applications have contributed to the 
rapid development of science and technology within 
the second half of the twentieth century. The use of 
numerous nuclear activities has, in many countries 
around the world, led to the contamination of signifi¬ 
cant parts of some ecosystems, which require remedi¬ 
ation. Due to the variety of sources and the 
characteristics of the receiving environment, the sce¬ 
nario of contaminated sites varies greatly. Nuclear 
weapon testing and severe radiation accidents, in par¬ 
ticular, have also greatly contributed to global concern 
about the need for reliable and comprehensive radia¬ 
tion risk assessment. In response, the development of 
countermeasures and remediation constitute two 
major radiation protection tools that can alleviate the 
radiological consequences of emergencies and reducing 
the radiation exposure from existing contamination, 
respectively. 

Currently, the quality of assessments of the conse¬ 
quences of the contamination of the environment 
varies depending on the amount of available data on 
the environmental behavior of different radionuclides 
in the environment. While the data available on some 
key dose forming radionuclides, especially those 
released on the major accidents, are reasonably good, 
that of many other radionuclides is rather patchy. 
Therefore, there are some notable deficiencies in infor¬ 
mation on environmental behavior of many important 
radionuclides [128] and also in the associated radiation 
effects on many biological and ecological end points 
particularly in plants and animals. In some cases, 
contaminated sites already released for general use 
have had to be reclassified because of an insufficient 
understanding or quantification of the radiation risk at 
the time when they were released. The need for 
remediation of such sites was not fully appreciated as 
soon as it might have been if better information had 
been available on the environmental pathways of 
exposure. 
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Appropriate response to contamination of the envi¬ 
ronment with radionuclides requires a comprehensive 
multidisciplinary approach as outlined in this chapter. 
The paper is focused on actions to prevent exposure of 
humans but the challenge is greater than this in that the 
impact of countermeasures or remediation on wildlife 
is difficult to quantify on the basis of current knowl¬ 
edge. Therefore, there is a significant future challenge 
to consider the need for, and potential implementation 
of, a holistic framework for the response to environ¬ 
mental contamination considering both the protection 
of human beings and the environment itself. 
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As world population increases, greater demand is 
placed on finite resources that earth can provide. 
Geographic areas once judged inadequate or inap¬ 
propriate for construction, water and wastewater 
management, or transportation routes, among other 
activities, are being considered for and being used for 
development. 

Environmental geology is the discipline which brings 
together these issues in a holistic approach to land use 
planning and utilization. This section of the encyclopedia 
examines some of the problems, techniques, and solu¬ 
tions for managing resources so that a balance can be 
maintained between development and the environment. 

Natural disasters are reported frequently and have 
varying impacts depending upon the closeness of 
the disaster to population centers and the magnitude 
of the hazard. In all cases, the primary purpose of study¬ 
ing these phenomena is to minimize loss of human life 
and secondarily to minimize loss of infrastructure and 
impacts on present and future developments. 

Each type of hazard impacts the environment in 
different ways, with some posing more danger to 
human life than others, and some posing more 
hazards to infrastructure and ongoing development. 
This section on environmental geology will address 
these various factors to provide a better understanding 
of how they occur, what can be done to minimize risk and 
how to plan better to address these concerns in the future. 

Impacts of natural disasters including earthquakes 
and tsunamis and occurrence of sinkholes and land¬ 
slides can have more devastating effects due to increases 
in world population. Two entries address earthquake 
issues: ► Earthquake Faulting, Ground Motions and 
Deformations and ► Construction Planning, Environ¬ 
mental Impact of Foundation Studies and Earthquake 
Issues. Earthquakes have occurred throughout history 
and left their traces as manifestations of geological 
processes on the geological record. 

Many lessons have been learned from damages 
caused by earthquakes. Guidelines have been developed 


for design of earthquake resistant structures under 
different ground conditions. Education of the public 
has saved lives and continues to be a critical need as 
is research to develop mechanisms to forewarn the 
public. 

Sinkholes and landslides are both naturally 
occurring and man induced. Both hazards can cause 
significant damages to various structures; however, 
they may not pose as significant a risk to human 
life as earthquakes. Protective measures can be 
performed to minimize the risks involved including 
preparing sinkhole maps which delineate the low, 
medium, and high risk areas subject to subsidence. 
If decisions are made to develop in sinkhole and 
landslide prone terrains, investigations of the geology 
and hydrogeology can determine where to seat struc¬ 
tures into bedrock so as to minimize damages. In 
addition, engineering design can address the added 
stresses hazards pose and incorporate additional 
measures so that buildings and infrastructure can 
withstand them. 

► Karst Terrane and Transportation Issues dis¬ 
cusses various types of sinkholes, what their causes 
are and how to protect against their occurrence. This 
entry addresses ways to remediate damage and prevent 
future reoccurrence. ► Land Subsidence in Urban 
Environment reviews an emerging cause of subsidence 
in the burgeoning population centers of the world. 
Better management of water resources is required to 
prevent further subsidence from occurring in urban 
areas. Optimal utilization of water supplies must be 
implemented to correct existing situations and to min¬ 
imize subsidence in the future. 

Overuse of water supplies in concert with drought 
conditions and land degradation is creating desertifi¬ 
cation conditions rapidly in the world today. Deserti¬ 
fication affects arid, semiarid, and dry subhumid 
regions; occurring on every continent, it impacts 
more than two billion people in these areas. Often 
these conditions occur across country borders and 
within areas of the world where tensions are already 
strained for political and religious reasons. ► Deserti¬ 
fication and Impact on Human Systems discusses the 
activities of the United Nations Convention to Combat 
Desertification (UNCCD) and emphasizes the need for 
sustainable development at the local level. Stakeholders 
in these inherently fragile ecosystems must be brought 
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together to arrive at solutions that will allow continued 
production of ecosystem goods and services. 

Not only do arid conditions require sustainable 
development, but urban areas do also. In The Interre¬ 
lated Roles of Natural Resources and Sustainability in 
Urban Planning, cities are increasingly recognized as 
the most important global sustainability challenge of 
this century. Here too, stakeholders are critical in 
regard to resource access and distribution. Studies of 
urban metabolism are required to explore interactions 
among resource flows, urban transformation processes, 
waste streams, and quality of life. Results of these 
studies must be correlated to address the challenge of 
combining urban planning with sustainable resource 
management. 

An integral part of continued growth and develop¬ 
ment of urban areas is detailed in ► Groundwater 
Salinity Due to Urban Growth. Understanding of 
groundwater is critical in regions where there is an 
increase in water demand and where shortages in 
water supply affect available and proposed infrastruc¬ 
ture. Knowledge of water accessibility in terms of quan¬ 
tity and quality can help reduce the risk of saline 
intrusion in coastal areas from urban growth. Socio¬ 
economic and environmental development programs 
require adequate political decision making and plan¬ 
ning at local and regional levels. Proper space planning 
implies that proposed land use is in agreement with 
what land “attitude” and its environment can provide 
without putting in danger sustainability. 

Geochemical modeling is applicable to investigations 
of water quality, as well as to many other fields. Modeling 
is a powerful and indispensable tool for research and 
investigations of environmental sustainability, science, 
and technology. It provides quantitative evaluation of 
complex processes and it can predict the extent and 
consequences of geochemical reactions in the order of 
tens of thousands of years. Basic concepts, information 
for accessing modeling codes and further readings are 
included in this chapter. More importantly, applications 
in the field are provided. An excellent example of the 
utility of geochemical modeling in environmental sus¬ 
tainability, science, and engineering is illustrated in geo¬ 
logical carbon sequestration. 

Much research is currently being performed on 
injection of carbon dioxide (C0 2 ) into deep geological 
formations as a climate mitigation tool. ► Geologic 


Carbon Sequestration: Sustainability and Environmen¬ 
tal Risk provides an overview of the research and ideas 
about the future. National and local regulations will 
require risk assessments of wellbore integrity, well 
injectivity, and long-term performance in the order of 
thousands of years. The geology of each site differs and 
performance assessments are necessary at all stages of 
C0 2 storage operations: site assessment and selection, 
design, installation, operations and monitoring, and 
closure and post closure. 

Drilling into deep geological formations is also 
taking place offshore in an activity known as Oceanic 
Hydrocarbon Investigation (OHI). OHI is currently 
increasing its production volume in an energy hungry 
society and the quest for finding and exploiting under¬ 
ground hydrocarbon resources is being continuously 
improved. 

This is particularly true in relation to underwater 
mapping, inspection, and monitoring of the environ¬ 
ment. ► Marine Life Associated with Offshore Drilling, 
Pipelines, and Platforms states that OHI has inflicted 
oil-spills on marine and coastal environments. At large, 
however, OHI appears to be environmentally friendly. 
This notion has been documented by extensive seafloor 
mapping and annual visual inspections of platforms, 
pipelines, and other infrastructure. 

Environmental concerns in marine environments 
are also being addressed as they relate to dredging of 
navigable waterways. Dredges of various designs have 
been used for many years to create and maintain nav¬ 
igable waterways to move people, goods, and materials. 
► Dredging Practices and Environmental Consider¬ 
ations discusses the history of dredging from the 
times of the pyramids when people used long-handled 
dipper shovels, to when productivity gains from 
using animals increased digging power. In the 1800s 
development of electric and steam power units enabled 
construction of mechanical, backhoe, and pipeline 
dredges. Hydraulic technology in the 1960s, utilizing 
hydraulic winches and hydraulic rotary cutter drives, 
facilitated removal of finer grade sediment. As technol¬ 
ogy improved, so too did concern for the environment, 
such that today there is much debate and research 
on the impacts of dredging. This concern has height¬ 
ened with contaminated sediments in the picture 
and the question of whether to dredge or leave them 
in place. 
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Navigable waterways integrally relate to dam con¬ 
struction and movement of people, goods, and services. 
Construction of dams has a much greater impact on 
the environment. Since time immemorial, people have 
looked for ways to tame surface waters to prevent 
floods and use water for different purposes, including 
transportation, irrigation, water supply, and electricity 
production. ► Dam Engineering and its Environmen¬ 
tal Aspects details the history of dams and their pri¬ 
mary role to store or to divert waters. Many dam 
projects have been controversial and potentially disas¬ 
trous. Failure of dams and their environmental impacts 
are the main reasons for controversy and fear. This is 
particularly true in karst environments which tend to 
be more sensitive to the influence of dams and reser¬ 
voirs. With increasing demand on water resources and 
electric power, an important issue is how to maintain 
the balance between the necessity for development and 
preservation of the environment. 

Energy continues to weave a thread through this 
section of the encyclopedia as the chapter on mining 
deals with mining of coal. Also addressed are impacts 
from mining of rare earth minerals, gold and copper 
among others. ► Mining and Its Environmental 
Impacts details the demands of the growing world 
population for increasing amounts of raw materials. 
To find acceptance and support in society, however, any 
future mining activity will demand state of the art 
environmental management and must incorporate sus¬ 
tainable development. A history of mining is provided, 
as well as an overview of the environmental effects of 
mining, differentiated by its relevant phases: explora¬ 
tion, exploitation and processing, decommissioning 
and rehabilitation. 

Nuclear energy has its own inherent concerns 
which are increased in times of natural disasters. 
► Groundwater Impacts of Radioactive Wastes and 
Associated Environmental Modeling Assessment 
discusses the long-term risks to human health and 
environment from radioactive waste contamination in 
soil and groundwater. Major sources of radioactive 
wastes and their impact on groundwater contamina¬ 
tion are reviewed along with the major biogeochemical 
processes that control the transport and fate of 
radionuclide contaminants. The evolution of mathe¬ 
matical models designed to simulate and assess the 
transport and transformation of radionuclides are 


also described. Understanding the fate and transport 
of radionuclide contaminants can improve the ability 
to forecast contaminant destination and select cost 
effective remediation technologies. 

Perhaps the greatest energy source known to man is 
from volcanoes. ► Volcanoes of Mexico includes an 
introduction to volcanoes: the reasons they form, 
where they are located, and the hazards presented by 
different types of eruption. Mexico is one of the world’s 
most volcanic regions and a summary of Mexican vol¬ 
canoes is presented. Volcanic eruptions can be divided 
into two broad categories: explosive or effusive. Various 
factors combine to determine how the magma emerges, 
the key ingredient of which is water. With the exception 
of ballistics, which do not reach very far from the 
volcano, volcanic hazards can present a major risk to 
populations living close by. Volcanologists need to 
investigate deposits of previous eruptions to under¬ 
stand what might happen in the future. Hazard maps 
can be created which indicate where eruptive products 
might accumulate. 

Water, energy, minerals, and the environment all play 
a role in development and future growth of the world 
and its population. No single aspect can be addressed 
without including another. This section of the encyclo¬ 
pedia on environmental geology places these different 
factors into perspective so that scientists, politicians, 
economists, planners, and stakeholders can work 
together to develop solutions that provide an acceptable 
balance to development and the environment. 


1 Environmental Impacts of an Open 
Ocean Mariculture Operation in 
Kona, Hawaii 

Neil Anthony Sims 

Kona Blue Water Farms, Inc, Kailua-Kona, HI, USA 

Article Outline 

Glossary 

Definition of the Subject 
Introduction 

Overview of the Kona Blue Water Farms Operation 


E 





3556 


E 


Environmental Impacts of an Open Ocean Mariculture Operation in Kona, Hawaii 


Farm Operations 

Localized Environmental Concerns and Actual Impacts 
Global Impacts: Fish Meal and Fish Oil Usage 
Future Directions 
Bibliography 

Glossary 

Sustainability The use of resources in such a manner 
as not to impinge upon the ability of future gener¬ 
ations to enjoy similar use. 

Open-ocean mariculture The culture of marine fish, 
invertebrates, or algae in exposed ocean locations. 

Definition of the Subject 

Open-ocean mariculture is the culture of marine fish, 
invertebrates, or algae in exposed ocean locations. The 
criteria for demarcation of open-ocean sites from 
inshore or nearshore aquaculture sites is often the 
subject of debate, with various schemes using some 
integration of distance from shore, depth of water, or 
exposure to open water and high sea conditions. 
A more meaningful definition is perhaps, simply, that 
an open-ocean mariculture site is a site where any of 
these conditions or criteria combine to disconnect the 
culture system from the surrounding substrates that 
might otherwise be subject to some significant envi¬ 
ronmental impact, or might become reservoirs for 
some ecological feedback with the culture system. 
Alternatively, rather than using a geographical or oper¬ 
ational distinction, open-ocean mariculture might 
simply be described as culture of marine animals or 
plants at sites that are further offshore, or in deeper 
water. This definition reflects the evolving, aspira- 
tional, and incremental improvement in culture sys¬ 
tems, rather than thinking of the open ocean as a final 
destination. 

Whatever the definition, open-ocean mariculture 
offers the potential for a scalable production system 
for valuable, healthful seafood, with negligible environ¬ 
mental impacts. 

Introduction 

The Seafood Crisis 

The oceans are in deep trouble. Even though the 
demand for seafood has increased, capture fisheries 


around the world are collapsing from overfishing, or 
are static. Wild stock fisheries cannot sustain any 
greater pressures, and clearly cannot scale to meet the 
growing needs of increasing population size, increasing 
affluence, and wider recognition of the health benefits 
of seafood. 

The USA is both symptomatic of the seafood crisis 
and significantly contributive to the problem. In the 
USA, closures or buyback schemes to reduce effort have 
effectively shut down once-productive fisheries for 
Atlantic tunas and swordfish, the groundfish of 
Georges Bank and other Northeast fisheries, Pacific 
Coast sardines, albacore, and more recently, rockfish. 
Other environmental concerns for endangered species 
or marine mammals have seen closures or limitations 
placed on fisheries for shrimp in the Gulf of Mexico, 
purse seining for tuna in the Pacific, and long-lining 
for tuna and swordfish in Hawaii and the US 
Pacific. Currently, over 80% of the seafood consumed 
in the USA is imported, and more than half of those 
imports are from farmed sources. 

Hawaii Fisheries Status The Hawaiian Islands 
represent a microcosm of the global fishing crisis. 
Increased fishing power - in terms of both the number 
of boats and available technologies - has seen the 
valuable deep bottom fish catches decline precipitously 
over the last few decades. GPS units allow deep bottom 
fishermen to return repeatedly and precisely to a “hot 
spot” until the fish are gone. Electric reels reduce the 
labor of hauling up from 60 fathoms to simply flicking 
a switch. Powerful echo sounders allow tuna fishermen 
to identify the location of individual fish tens of 
fathoms beneath their boat. At the same time, catches 
of sashimi-grade tuna have plummeted due to compet¬ 
ing gear-types (seiners, long-liners, pole-and-line) and 
conflicting national interests throughout the range of 
these highly migratory species. Some 20 years ago, over 
25 commercial fishing boats worked out of the local 
Honokohau Harbor in Kona, Hawaii, targeting deep¬ 
water snappers and big-eye tuna in the “ika-shibi” 
fishery. Today, there is only one commercial fishing 
vessel based there, and even that is only part-time. 

The decreasing catch volume and decreasing aver¬ 
age size of fish caught in the bottom fish fishery in 
Hawaii is causing increasing concern. By 1996, only 
20% of the onaga (Etelis coruscans) catch in the main 
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Hawaiian Islands (MHI) had previously spawned; sim¬ 
ilar declines were evident among other species stocks. 
These species’ biological characteristics make them 
vulnerable to recruitment overfishing; NOAA Fisheries 
staff estimate that an onaga attains maturity at about 
4.1 years of age, at a size of 66 cm. With Federal and 
State data indicating significant overfishing of these 
stocks, increased regulation became imperative. In 
June 1998, new legislation went into effect, establishing 
limits on fishing gear, bag limits, registration of bottom 
fishing craft, and restricted fishing areas (up to 20% of 
the bottom fish ground was placed off limits) for the 
commercial and recreational fishery industry. 

Further restrictions are still needed. Over the last 
two summers, the State declared all bottom fishing 
closed for the summer period from May 1st to Septem¬ 
ber 30th, throughout the island range. State authorities 
have indicated that this seasonal closure may need to be 
repeated for coming years, as well, before there is any 
measurable improvement in stocks. The majority of 
high-value species consumed in Hawaii already are 
imported from other areas, such as the South Pacific 
and South East Asia. 

Aquaculture as Part of the Solution Aquaculture 
offers the only viable solution to the growing demand 
for sustainable, healthy sources of seafood protein for 
human consumption. Fish farming reduces exploit¬ 
ative pressure on already depleted wild stocks, supports 
the growth of coastal and rural industries, and yields 
a product that is low in saturated fat and high in 
protein. The annual contribution of aquaculture to 
global aquatic production is now almost equal to that 
of wild catch (47% versus 53% [1]). In 1985, aquacul¬ 
ture represented only 5% of US fish consumption, yet 
today that figure stands at around 40%. Aquaculture 
growth is rapid, and is projected to increase in pace. 
The US Department of Commerce has set a goal of 
a fivefold increase in domestic aquaculture production 
value, to $5 billion, by 2025. 

Domestic aquaculture production using existing 
methods or species cannot keep pace. Almost all US 
production is from freshwater species; the only marine 
species cultured in any quantity are salmon and striped 
bass, both of which are anadromous (freshwater 
spawning). It is almost impossible to obtain permits 
for nearshore farm sites; there is intense competition 


among different user groups for nearshore waters, 
and aquaculture has been besmirched in the conven¬ 
tional wisdom as environmentally destructive and 
unsustainable. 

The Open-Ocean Potential Open-ocean mariculture 
technology has recently moved from the realm of sci¬ 
ence fiction [2, 3] to commercial reality. In the last few 
years, there have been dramatic advances in the legal 
and engineering fields, which have opened up the new 
fish farming frontier of offshore areas. New submers¬ 
ible net pen systems have been pioneered by 
OceanSpar, LLC, of Washington State (“Sea Station™” 
net pens), and Ocean Farm Technologies, Inc., of 
Maine (“Aquapod™” net pens). These new technolo¬ 
gies have dramatically increased the workable extent of 
ocean farming, by providing seaworthy platforms for 
grow-out of fish in exposed offshore environments. 
These farms are therefore able to be located further 
offshore, in sites where currents, prevailing sea condi¬ 
tions or seasonal storm events may make surface 
pens inadvisable. These operations are also usually 
portrayed as having less environmental impact than 
nearshore fish farms. The rationale is that there is 
a cleaner and clearer disconnect between the farm and 
the underlying benthic substrate or the adjacent shore¬ 
line so that there is both minimal environmental 
impact and negligible potential for negative accumula¬ 
tive feedback from the environment to the culture 
system. 

But how have these first open-ocean mariculture 
operations actually performed? Is there any basis in fact 
to these claims of environmentally sound mariculture 
in offshore locations? What are the opportunities for 
expansion of this industry, and what are the risks and 
precautions - both environmental, biological and 
economic - that should accompany such scaling? 

This entry reviews some of the recent progress 
toward realizing this potential, as exemplified by one 
farm operation, in Kona, Hawaii. The environmental 
performance over the past 5 years of operation at Kona 
Blue Water Farms’ site is instructive. It may demon¬ 
strate how open-ocean mariculture may represent 
a partial solution to the seafood crisis: a scalable, envi¬ 
ronmentally sound production system for high-value 
seafood. The environmental impacts of this one open- 
ocean mariculture site are reviewed here, in the context 
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of both local ecosystem effects and global marine 
resource use efficiency: the “footprint” on the oceans. 

Overview of the Kona Blue Water Farms 
Operation 

Prior to 1998, Hawaii’s ocean-leasing legislation lim¬ 
ited any potential project to a maximum of 4 acres, 
which had to be used for either educational or research 
purposes, and not for commercial gain. Through sev¬ 
eral years of work by industry aspirants, and strong 
leadership by the State Aquaculture Development Pro¬ 
gram, legislation was passed that allowed commercial 
offshore fish farms or energy projects. 

The Kona Blue Water Farms principals had been 
involved through the legislative review, and with the 
passage of the bill, began research into developing 
hatchery culture techniques for high-value marine 
fish, simultaneously surveying the Kona Coastline for 
prospective offshore farm sites. After an extensive 
3-year process of consultation and consensus-building 
with the community, Kona Blue was granted the req¬ 
uisite State and Federal permits for the original off¬ 
shore farm site in March, 2004. The operation began 
deployment in February 2005 and first fish were 
harvested offshore in September, 2005. Since then, pro¬ 
duction has grown to the extent that Kona Blue has 
been harvesting up to 25,000 lb of sashimi-grade Kona 
Kampachi® per week, and up to 500 T per year. Kona 
Kampachi® (Seriola rivoliana) is also known as kahala, 
Long-fin Amberjack or Almaco Jack. It is related to the 
Japanese hamachi (S. quinqueradiata ), but is native to 
Hawaii, and is distributed throughout the warm waters 
of the world. 

The Offshore Farm Site and Farm Operations 

The Lease Area Site selection is a critical component 
for any mariculture operation, but is particularly so for 
an innovative offshore farm that is pioneering both 
a new permitting process and a new net pen system. 
The original farm lease site was selected on the basis of 
the following criteria: 

1. The selected site was in a deep-water area, over 
200 ft deep, with brisk currents. 

2. There was little or no public use of this area. The 
farm site lay between the limits of normal 


recreational scuba diving (around 120 ft) and the 
normal depths for offshore trolling for ono (wahoo, 
Acanthocybium solandri). 

3. The site afforded some protection from both Kona 
storms and the strong trade winds (Figs. 1 and 2). 
The proximity to shore also allows for future telem¬ 
etry links to shore for farm control and security. 

4. There was ready access from Honokohau Harbor, 
5 miles to the south, which provides support 
facilities such as slips, fueling, and land for staging 
of equipment and feed. 

5. The site was directly offshore from the Kona Inter¬ 
national Airport and NELHA (Fig. 2), and, as such, 
its use was consistent with the adjacent land uses, 
and it represented no significant impact on the 
viewplane. 

The farm site’s topography and oceanography are 
distinguished by the depth of water, the bare sand 
substrate, the strong currents through the area, the 
exposure to high winter surf and strong trade winds, 
and the adjacent shoreline of a narrow coral bench reef 
with a steep basalt (lava) cliff. A few black sand beaches 
also lie along the coastline, to the north of the site, but 
these are little used, except by recreational fishermen. 
The preexisting uses of the farm lease area itself were 
negligible, because of its depth, the paucity of fish, and 
the barren benthos. 
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Environmental Impacts of an Open Ocean Mariculture 
Operation in Kona, Hawaii. Figure 1 

Location of the Kona Coast on the western side of the Big 
Island of Hawaii affords moderate protection from trade 
winds and winter swells 
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Environmental Impacts of an Open Ocean Mariculture 
Operation in Kona, Hawaii. Figure 2 

Kona blue water farms' site located in waters over 200 ft 
deep over a sand bottom, a 1/2-mile offshore from 
a pristine coral fringing reef abutting lava cliffs. The site is 
a mile north of Keahole Point and west of the Kona 
International Airport 


The net pens are all concentrated toward the center 
of the lease area (see Figs. 2 and 3), within two mooring 
arrays: one containing six net pens, and the other 
containing two net pens and the feed barge. The closest 
distance from the edge of the central grid array to shore 
is approximately 2,600 ft, or almost half a mile to the 
northeast, to Unualoha Point. 

The farm site lease provides “negotiated exclusiv¬ 
ity”: Transit, trolling, hoop-net fishing and hook-and- 
line fishing are permitted throughout the lease area, but 
for liability, insurance, and safety reasons, there is no 
authorized anchoring, scuba diving, or swimming 
permitted. 


The 90 acre lease area initially accommodated eight 
submersible Sea Station net pens, each of around 
3,000 m 3 capacity (Fig. 4). The outermost area of the 
lease is used almost solely for mooring lines, which 
require a 5:1 scope. The net pens were originally tied 
into submerged grids that were anchored into the soft 
substrate using steel embedment anchors and chains. 
A series of buoys and weights ensure that the anchor 
lines are perpetually taut, to eliminate any risk of 
entanglement by marine mammals. Bridles from the 
mooring grid corners attach to the net pen rims, to 
hold the net pens in place in each grid square. 

Farm Operations 

The daily activities on the farm primarily consist of 
feeding the fish in the pens. Underwater video cameras 
inside the net pens are used to relay visual images to the 
operators on the feed barge. This enables the feed 
operators to regulate feed to ensure that no feed is 
wasted, and that excess feed does not fall below the 
net pen. Any fish carcasses are regularly removed by 
divers. Carcasses are disposed of as solid wastes in the 
county landfill. 

Harvests usually occur twice each week. Fish are 
harvested into an ice-brine slurry, to quickly and 
humanely kill the animals with a minimum of damage. 
Fish are all transported whole, in ice-brine, to a single 
land-based processing facility, for packing and ship¬ 
ping. No fish processing occurs at sea during the har¬ 
vests. Disposal of processing wastes is the responsibility 
of the wholesalers or other purchasers of the fish, but, 
at present, most trimmings from fillets go into the 
landfill. 

Support activities for the existing operation are 
based out of Honokohau Harbor, where a half acre of 
land rented from the State accommodates containers 
for feed storage, gear storage areas, a closed workshop 
area, restroom, and office. 

The farm is also serviced by a semipermanent feed 
barge/security platform vessel, which has been 
deployed on-site since October, 2007. A separate har¬ 
vest boat - the 74 ft F.V. Kona Kampachi - transports 
the harvested product back from the farm site to the 
harbor. Several other smaller work boats are also used 
to support net pen and grid maintenance and cleaning, 
and other tasks. 
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Environmental Impacts of an Open Ocean Mariculture Operation in Kona, Hawaii. Figure 3 

Modified mooring array and grid dimensions - plan view. The number of net pens is presently being reduced from current 
eight smaller pens to five larger pens, with the same overall culture capacity (24,000 m 3 ). The submerged central grid 
remains at around 30 ft (9 m) beneath the surface 


Localized Environmental Concerns and Actual 
Impacts 

The Presumed Problems 

Aquaculture - or indeed, development of any food 
production system - brings with it attendant environ¬ 
mental concerns. Fish farms are widely accused of envi¬ 
ronmental degradation. The concerns that are often 
voiced include the following local-affect possibilities: 

• Detrimental impacts on water quality 

• Nutrient enrichment of the substrate beneath the farm 

• Antifoulant paints from net pens to contaminate 
the substrate 

• Therapeutant or antibiotic misuse to harm the 
surrounding biota 


• Escapes to outcompete wild fish for spawning 
grounds or feed 

• Escapes to dilute the wild fish gene pool 

• Proliferation of pests, parasites, and diseases inside 
the net pens, which can then be transferred to wild 
fish 

• Entanglement of whales, dolphins, and other 
marine mammals 

• Disruption of marine mammal or other species’ 
migratory paths 

• Harmful deterrents or fatal control measures 
against predators 

• Excessive use of fish meal and fish oil, leading to 
overharvesting of the smaller pelagic species 
targeted by industrial reduction fisheries 
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Environmental Impacts of an Open Ocean Mariculture Operation in Kona, Hawaii. Figure 4 

Submersible Sea Station® Net Pens deployed on Kona Blue farm site, (a) Design of submersible SS6200 Sea Station net 
pens, with central steel spar and steel rim. (b) Sea Station SS3000 raised to rim-level on the Kona Blue farm site. 

(c) A submerged Sea Station SS3000 and diver on Kona Blue site 


• Exclusion of other user groups from traditional, 
cultural, or recreational uses of the farm area 

• Visual impact of the net pens on the viewplane 

With almost 5 years’ experience at the Kona Blue 
farm site, then, it is appropriate to evaluate the actual 
data and observations recorded at the Kona operation, 


and to compare this experience with the concerns that 
had been, and continue to be, voiced. Each of these 
issues is examined in detail, below, beginning with an 
evaluation of the de novo environmental status of the 
Kona Blue farm site, and then detailing the impacts that 
have occurred, their context, and their significance. 
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The Actual Observed Impacts: Locally 

Water Quality and Effluent Impacts The water 
quality at the farm site is close to oceanic, with strong 
currents and low turbidity. Underwater visibility 
usually exceeds 100 ft or more. 

General water movement patterns at the farm site 
are governed by the longshore currents past Keahole 
Point (the western-most point of the Big Island of 
Hawaii), 1 mile to the South. An S4 current meter 
deployed at the farm site over several periods since 
2004 showed regular peak current speeds of over 
50 cm/s (about 1 kt, at a depth of around 40 ft). Current 
headings were longshore: generally to the North (pre¬ 
dominantly), but also to the South. The two points of 
first impact downstream from the farm site are there¬ 
fore either Keahole Point, around 1 mile to the south of 
the site, or the Mahai’ula-Makalawena shelf area, 
around 3 miles to the north. 

Because of the community concerns about poten¬ 
tial impacts from the farm operation on water quality, 
the company had made commitments during the orig¬ 
inal permit process to a policy of ongoing transparency 
and objectivity in monitoring. These commitments 
included: 

• Use of objective, third party experts to collect the 
water quality samples. 

• Use of local water quality laboratories - such as 
NELHA Water Quality Lab, or local private labora¬ 
tories - for conducting the sample analysis. 

• Placement of copies of all monthly water quality 
monitoring reports at local repositories, such as the 
State Aquatic Resources office at Honokohau, or 
the NELHA library, so that local residents can 
review this data. 

• Provision of reasonable access to Federal, State, and 
County officials for monitoring and oversight 
purposes. 

Monthly measurements have been taken of ammo¬ 
nia and turbidity (the two most relevant water quality 
parameters for fish farming) at three depths (surface, 
mid-water - 50 ft deep, level with the submerged net 
pens, and at the bottom) and at a total of seven stations 
(two control stations upcurrent, one effluent station 
immediately downcurrent of the net pen with the 
greatest biomass, and four “zone of mixing” stations 


4,000 ft downcurrent: Fig. 5). Quarterly measurements 
are also taken for a range of other parameters. 

Detailed water quality data are available on the 
company’s web site [4]. Figure 6 shows the mean for 
each sample site for turbidity for September, 2008, 
when the farm was at peak production of around 500 
T annually. Turbidity is probably the best metric for fish 
feces and other particulates in the water, and so is most 
likely to reflect any impact from the farm’s presence. 
These data are definitively clear - there is no discernible 
difference between water quality parameters at the 
upcurrent control sites, and the effluent site (1 m 
downcurrent of the net pen with the highest biomass) 
or the “zone of mixing” sites downcurrent. These 
results confirm that there is no measureable impact on 
water quality from the existing farm operations. 

Benthic Impacts The substrate beneath the farm is 
over 200 ft deep, and almost exclusively comprises bare, 
coarse sand. Located along the shoreline, some 2,000 ft 
to the East (directly across the longshore currents) is 
a diverse coral reef community. 

Impacts on Substrate Beneath and Around the Farm 
Site Prior to farm installation, a preliminary survey 
of the site was undertaken by repeated bounce dives, 
using scuba, to depths of 220 ft. Because the depth of 
the farm site is beyond the limits of normal safe diving, 
and the strength and unpredictability of the currents 
precluded ready use of grab samples or drop video 
cameras, the original permit provided that no benthic 
monitoring would be required. Over time, however, 
permit requirements were tightened to include grab 
sample monitoring of substrate chemistry and 
infaunal micromollusk community structure, and 
video monitoring using drop cameras. These samples 
have been conducted quarterly, and reports and video 
footage have been posted on Kona Blue’s website [4]. 

These results generally indicate that there has been 
no measureable impact on the benthic community 
around the farm site. There have been episodic 
perturbances of substrate chemistry immediately 
underneath the cage footprint, with a few instances of 
anoxic conditions during 2007, during periods when 
a new feed distribution system was being tested. This 
resulted in some pulverization of pellets, and reduction 
of feed to a “slurry,” rather than discrete pellets. Once 
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Environmental Impacts of an Open Ocean Mariculture Operation in Kona, Hawaii. Figure 5 

Water quality sampling station map. Aerial photograph of the Kona Blue site showing water quality monitoring 
sampling station locations. Sampling stations under a prevailing N-setting current are shown. Under a S-setting current, 
control and compliance stations will be reversed. ZOM = Zone of Mixing 


the feed system was refined, the substrate returned to its 
more normal condition, there was no further signifi¬ 
cant nutrient enrichment of the substrate. 

Filamentous algae have also been visible in the 
drop-camera videos from around the farm site. These 
appear to have been detached from the cage mesh or 
the mooring lines, as the algae are not attached to the 
coarse sand substrate. Presumably, these algae are 
dispersed during periods of high current. 

Monitoring of infaunal micromollusc assem¬ 
blages in the substrate samples has also demon¬ 
strated that there has been no significant change in 


the community structure resulting from the farm 
presence (see reports on-line: [4]). 

Impacts on the Adjacent Coral Reef Community A 
comprehensive survey of marine biota was conducted 
on the reef directly adjacent to the existing farm lease 
area, just south of Unualoha Point [5]. The survey of 
the benthic biota of the fringing reef crest used 
protocols identical to those employed by the State s 
Division of Aquatic Resources, in their West Hawaii 
Reef Management Task Force Survey. This provided an 
extensive set of “control” sites: the other benthic and 
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Environmental Impacts of an Open Ocean Mariculture Operation in Kona, Hawaii. Figure 6 

Typical turbidity data from monthly sampling around the Kona Blue farm site. There is no discernible difference in 
water quality between the two Control sites (upcurrent), one Effluent site (immediately downcurrent) and four Zone 
of Mixing sites (4,000 ft downcurrent) of the farm site 



fish data from the sites along the 90 miles of coastline 
on West Hawaii. A series of four transects of 25 m x 2 m 
extended parallelly to the reef crest, immediately 
shoreward of the seaward edge of the reef. Video 
footage was made of these transects, and digitized for 
selection of random points on the video frames. 

The Makako Bay-Unualoha site has been repeat¬ 
edly resurveyed since the original 2003 survey. 
Although no formal reports have been compiled, 
there have been no significant changes in benthic com¬ 
munity composition or fish populations reported. 

Biofouling on the Farm Structures There is also 
profuse growth of macro-invertebrate biofouling on 
the grid-lines and buoys of the mooring array, as well 
as on the bridle lines that attach the cages to the grid 
and the rims of the cages themselves (Fig. 7). This 
fouling includes diverse macroalgae, bivalves (several 
species of mussels and oysters: Pteria sp and Pinctada 


spp), corals (primarily Pocillopora and Porites ), 
sea urchins (primarily Echinothrix calamaris ) 
nudibranchs ( Stylocheilus longicauda ) and sponges. 
These all settle out of the plankton onto the farm 
structures, and their presence does not represent any 
significant or even measureable reduction in the 
available recruits to the nearby coral reef area. The 
growth of the corals, particularly, is compelling 
evidence that the presence of the fish farm operation 
is not deleterious to benthic organisms. 

Apart from the one brief instance of anoxic condi¬ 
tions beneath the net pens, there have been no other 
adverse impacts on benthic communities in, under¬ 
neath, or around the net pen area. 

Pests, Parasites, and Pathogens Kona Blue employs 
an integrated pest management strategy to optimize 
fish health, reduce interactions or minimize impacts 
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Environmental Impacts of an Open Ocean Mariculture 
Operation in Kona, Hawaii. Figure 7 

A Pocillopora damicornis coral colony on a mooring grid 
line around the Kona Blue net pens (within about 15 m of 
net pen stocked with fish). The presence of coral colonies 
that are highly sensitive to nutrient enrichment on 
moorings and buoys confirms that the operation has no 
significant impact on marine biota in the area 

on wild fish stocks, and reduce any potential environ¬ 
mental impacts from therapeutant use. As Seriola 
rivoliana is a new species, any therapeutant use must 
be conducted under an Investigational New Animal 
Drug (INAD) permit. INAD permits operate under 
the oversight of US Fish and Wildlife Service 
(USFWS), and the Food and Drug Administration 
(FDA), with State oversight through Office of Conser¬ 
vation and Coastal Lands (within Department of Land 
and Natural Resources) and Clean Water Branch 
(CWB, within the Department of Health). Federal 
Environmental Protection Agency (EPA) has oversight 
through the NPDES (National Pollutant Discharge 
Elimination System), which is administered by CWB. 

As with almost all farmed animals, S. rivoliana is 
subjected to small external pests - in this case, the skin 
fluke, Neobenedenia sp, that attaches itself to the fish’s 
skin. These flukes do not pose any risk to human 
health, and do not themselves detract from the quality 
of the harvested product, but may cause irritation to 
the fish. If left unchecked, the flukes can become 
a health problem for the animal, as the fish rub them¬ 
selves on the netting to ease the irritation. Kona Blue 
uses occasional treatments of dilute hydrogen peroxide 


solution (at effective dosage rates of 200-300 ppm) to 
control levels of skin flukes among the fish in the 
net pens. Hydrogen peroxide (H 2 0 2 ) breaks down 
rapidly in sunlight to form oxygen and water. Hydro¬ 
gen peroxide is also considered an acceptable Organic 
aquaculture treatment under the draff USDA 
Organic aquaculture guidelines, and USDA Organic 
agriculture standards. 

Under the permits in place at the existing site, such 
therapeutant use must demonstrate that there is no risk 
to the fish under treatment, or to the environment, or 
to human health. Monitoring of the effluent from any 
bath treatment at 100% concentration is mandated 
under the “Whole Effluent Toxicity” (WET Test) sec¬ 
tion of the NPDES permit. Results to date from the 
existing farm operation confirm that there are no sig¬ 
nificant environmental impacts from the use of the 
hydrogen peroxide. Ongoing effluent monitoring for 
WET test bioassays using larval fish (Pacific topsmelt, 
Atherinops affinis ; conducted by Nautilus Laboratories 
in San Diego) demonstrate no significant difference in 
the rates of larval fish survival between control samples 
taken 4,000 ft upcurrent of the net pen, and samples 
taken of the whole effluent (100% concentration of the 
bath treatment water) at the conclusion of the bath 
treatments. There is therefore no mechanism for any 
measureable impact on the pelagic or benthic commu¬ 
nities, or the surrounding water quality from the use of 
this therapeutant. 

In addition, monitoring of wild kahala ( Seriola 
rivoliana) stocks indicates that there is no significant 
proliferation of Neobenedenia sp. in the population 
around the farm area. Broodstock are collected period¬ 
ically from around the farm area, to replenish the wild 
stocks in Kona Blue’s hatchery. These fish are usually 
taken by commercial fishermen along the “drop off” of 
the marlin fishing grounds, about 1 mile to the South of 
the farm, and are sampled for ectoparasites upon cap¬ 
ture, by immersion in a freshwater dip. Although these 
fish are usually infested with a number of other ecto¬ 
parasites, the prevalence of Neobenedenia sp has never 
averaged much more than one individual per fish. 
By contrast, a parasitic copepod (sea lice, similar to 
Caligus ) infests wild fish at average rates of around 
ten individuals per fish, and yet is not found at all 
on the farmed fish, and does not proliferate within 
the net pens. 
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A number of innovations, either in progress or 
planned, should also further reduce the proliferation 
of Neobenedenia on fish inside the net pens. The farm is 
being reconfigured to fewer, larger Sea Station net pens. 
With a planned reduction in the number of net pens, 
a reduction in the surface area-to-volume ratio of the 
remaining net pens (from double-cone net pens to 
a more cylindrical shape), the improved surface mate¬ 
rial characteristics and rigidity of the Kikkonet™ plas¬ 
tic monofilament net mesh (which make it easier to 
clean), and the improved access for offshore crew to 
regularly clean the nets from the surface (thereby 
breaking the skin fluke life cycle by dislodging the 
adhesive eggs on the mesh), the proliferative tendencies 
of the skin fluke should be further reduced. 

Kona Blue does not use prophylactic antibiotics, 
but has, under the same regulatory oversights described 
above, and with veterinary guidance, used Florfenicol® 
to treat Streptococcus iniae infections that sometimes 
afflict juvenile fish after the stresses of transfer offshore. 
These treatments last for 10 days, and are also accom¬ 
panied by WET test water quality monitoring. These 
WET tests have repeatedly demonstrated no impact on 
marine biota. A vaccine is available for S. iniae , which 
would avoid the need for these treatments, but regula¬ 
tory requirements ironically prevent the use of the 
vaccine at this time. (The vaccine would qualify as an 
Organic treatment under draft USDA Organic aqua¬ 
culture guidelines). S. iniae infections are not an issue 
with larger fish, once they have overcome the initial 
stress of transfer from the nursery to offshore. 

Much of the concern over proliferative capacities 
for fish farm pests, parasites, or pathogens is derived 
from conflicts between salmon farming and wild 
salmon runs. Some research - though disputed - 
suggests that sea lice infestation rates can be exacer¬ 
bated by the presence of salmon farms and can then be 
detrimental to survival rates of juvenile salmon as they 
migrate past the farms to the sea. Most marine fish, 
however, are broadcast spawners. Juvenile marine fish 
are therefore dispersed over vast areas of ocean and 
reef, and do not usually have vulnerable migratory 
patterns. Given such a distinct difference in life histo¬ 
ries between salmonids and marine fish, there would 
seem to be limited applicability of the salmon and sea- 
lice research, or the concerns with impacts on vulner¬ 
able life stages, to open-ocean mariculture. 


Interactions with Wild Fish Kona Blue cultures only 
Kona Kampachi® ( Seriola rivoliana) on the offshore 
site, but the pertinent State permit also allows the 
company to possibly culture other amberjack (the 
other kahala species, S. dumerili ), mahimahi 
(Coryphaena hippurus ), and Pacific threadfin 
(Polydactylus sexifilis). 

Aggregative Effects on Wild Fish Stocks The existing 
operation does have an aggregative impact on some 
species of fish in the area, but this is considered neither 
deleterious nor significant. Fish are attracted to the site 
for a number of possible reasons: the fouling on the net 
pen, the occasional release of small quantities of 
uneaten food from the net pen during periods of strong 
currents, and the aggregative nature of objects in open 
water (as for Fish Aggregation Devices). The makeup of 
the resident and transient fish communities around the 
net pens varies over time. 

Pelagic or larger demersal fish frequently occurring 
around the Kona farm site include mackerel scad 
(opelu: Decapterus macarellus ), ulua (giant trevally, 
Caranx ignobilis ), wild kahala ( Seriola rivoliana and 
S. dumerili ), and barracuda (kaku, Sphyraena 
barracuda). Occasionally, schools of rainbow runners 
(kamanu, Elegatis bipinnulatus ) and false albacore tuna 
(kawakawa: Euthynnus alletteratus ) move through the 
net pen area. Larger pelagic fish, such as yellowfin tuna 
(ahi: Thunnus alabacares) and wahoo (ono: 

Acanthocybium solandri) are also occasionally attracted 
to the area by the baitfish, or by the net pens 
themselves. 

A number of other, smaller fish that are more nor¬ 
mally associated with coral reefs settle out of the plank¬ 
ton and assume residence either around the subsurface 
buoys or around the cages themselves. Such residents 
include schools of Sergeant majors ( Abudefduf 
abdominalis ), dascyllids ( Dascyllus albisella ), chromids 
(primarily Chromis hanui and C. ovalis) wrasses (pri¬ 
marily Coris spp and Thalassoma spp), and kyphosids 
(Kyphosus spp). As these fish are settled from the plank¬ 
ton, their presence is not considered a significant 
detraction from the biomass or diversity of the fish 
fauna on the adjacent reef. 

Escaped Fish Interaction with Wild Stocks Concerns 
about potential negative impacts of escaped fish are 
often cited as one of the reasons for objections to fish 
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farming. However, this issue is most pressing only 
where non-native fish are cultured in areas where 
escapes might become established or compete with 
local species, such as Atlantic Salmon in the Pacific 
Coast of Canada. Kona Kampachi®, by contrast, is 
native to the waters of Kona. In addition, Kona Blue 
recognizes that the innovative net pen engineering 
employed means that there is some possibility of escape 
incidents over the initial proving period and develop¬ 
ment of refinements. In consideration of this, Kona 
Blue has deliberately not applied any selective breeding 
in the hatchery, and has not used any broodstock 
beyond F2 (i.e., all broodstock are either wild-caught, 
or first- or second-generation captive-reared). There is, 
therefore, no mechanism for development of any sig¬ 
nificant difference in the genetic makeup of the fish 
inside the net pen from the fish in the wild. This 
reduces any potential impact from escapes to merely 
direct ecological impacts. 

Furthermore, the concerns with the effects of fish 
farm escapees on wild fish genetics are, again, largely 
a consequence of the conflicts between salmon-farming 
interests and wild salmon conservationists. Yet wild 
salmon stocks are unique, in that each river system or 
stream may have a genetically discrete stock from the 
adjacent watershed. Any blurring of this finer-scale 
differentiation, by interbreeding between escaped 
salmon and wild stocks, could represent a loss of 
genetic diversity. However, these concerns are not ger¬ 
mane to farming of marine fish in the open ocean. As 
marine fish are broadcast spawners, there is only 
a coarse zoogeographic genetic granularity. Tagging 
research demonstrates that Seriola and other carangids 
migrate frequently between islands in the Main 
Hawaiian Archipelago. One Seriola migrated from 
French Frigate Shoals, in the Northwestern Hawaiian 
Islands, to the Big Island - a distance of 678 miles (over 
1,000 km) over 3.6 years, at liberty [6]. The potential 
genetic impacts of Kona Kampachi® escapees on the 
wild stocks of S. rivoliana are therefore minimal. 

Those Kona Kampachi® that have escaped from the 
Kona Blue net pens - either through “leakage” as divers 
enter or leave the pen through a submerged zipper, or 
from breaches in the netting - are invariably subjected 
to very heavy predation pressure. Individual escapees 
survive outside of the zipper for usually less than 
a minute before being eaten by either the ulua or the 


bottlenose dolphins that are frequently in the area. The 
long-term prospects for survival and reproductive suc¬ 
cess of any escapees are therefore highly dubious. In 
addition, any escapees that do survive in the wild are 
presumably entering a wide-open ecological niche, due 
to the severe depletion of other deep-water species - 
such as the deep-water snappers - by commercial fish¬ 
ing. There is little likelihood of escapees competing in 
any significant manner with the few remaining wild 
snapper stocks. 

Other Wildlife Interactions 

Sharks The single overarching feature of shark inter¬ 
action with the offshore fish farm site has been - 
contrary to conventional wisdom and activist concerns 
prior to the farm deployment - the general absence of 
sharks around the net pens. For the first 8 months of 
operation, only one fleeting shark sighting occurred: 
a small tiger shark (mano: Galeocerdo cuvier). Obser¬ 
vations by farm workers suggest that there is a general 
pattern of brief influxes of tiger sharks to the area in the 
months of September and October of each year. Most 
of these animals appear individually, or in pairs, with 
a range of sizes from 8 to 15 ft in length, and they do not 
generally seem to take up residence on the farm site. 
Most tiger sharks only show interest in dead fish inside 
the net pens, and generally exhibit no interest in or 
aggression toward the farm workers. 

In the first year of operation, however, a single tiger 
shark appeared to take up residence at the farm site. As 
the animal began to show aggression toward inanimate 
objects such as surface buoys, and then farm divers, it 
was humanely dispatched. Recognizing the long-term 
unacceptability of such predator control measures, 
Kona Blue sought alternative means of addressing this 
issue. A Shark Management Plan was then developed in 
consultation with State Aquatic Resources personnel in 
Kona and other experts, which included a range of 
measured responses, and nonterminal resolution if 
animals ever again become problematic at the site. 

In subsequent years, tiger shark sightings usually 
increased in frequency at the farm site in the late- 
September early-October period. However, sharks 
were neither persistent, nor consistent. Farm opera¬ 
tions had become more adept at removing dead fish, 
and the Shark Management Plan allowed divers to 
continue to work safely. One animal - or rarely two, 
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contemporaneously - may appear at the site, and 
remain for an hour or so, before moving away, 
presenting little inconvenience to farm operations, 
and no real risk to diver safety. 

Kona Blue has also, in collaboration with DAR and 
HIMB researchers, established a receiver station on the 
farm site, as part of the larger research program for 
tracking tiger shark movements along the West Hawaii 
coastline. The first data series obtained suggested that 
the observations by the farm work crews were correct - 
that tiger sharks only very infrequently pass by the site, 
and rarely do they show any interest in the operation. 
From July 2006 to May 2007, there were a total of eight 
(8) records of tagged tiger sharks in the Kona Blue farm 
area. None of these sharks took up residence. One 
animal passed by the farm site three times in 2 months, 
another animal was recorded twice in 2 months, and 
three other animals had single records. (Fig. 8). 

Over 2008 and 2009, however, further tiger shark 
tagging trials showed that two animals appeared to 
regularly return to the farm site over periods of up to 
5 months. Two other sharks ranged over the entire 
Kona coast area, but for several weeks at a time were 
recorded exclusively from the farm site. All animals 
eventually moved on; one was later detected off Maui. 
While these results suggested that the farm site had 
become a “waypoint” for the animals over a few 


Abacus plot showing dates (blue crosses) on which five transmitter 
-equipped tiger sharks were detected at Kona fish farming cages 



2006 2007 

Environmental Impacts of an Open Ocean Mariculture 
Operation in Kona, Hawaii. Figure 8 

Frequency of tagged tiger shark occurrence at Kona Blue 
farm site: 2006-2007. Five tagged sharks were recorded 
over an 11-month period, with the most frequently 
occurring shark being present three times over a 2-month 
period. No animals took up residence, or showed any 
strong site affinity 


months, the “long-term entrainment (e.g., years) of 
tiger sharks is unlikely” [7]. 

There have also been sightings of sandbar sharks 
(mano: Carcharhinus plumbeus ) around the net pens. 
Initially, these were rare (none in the first year of 
operation), but since October, 2006, the frequency of 
sightings and number of sandbars has increased. These 
animals are usually seen in small groups (one to four 
sharks), below the net pens at depths of over 100 ft. 
They rarely rise up to the level of the net pens. Sandbar 
sharks are more secretive, and cannot readily be distin¬ 
guished by any markings. No sandbar sharks were 
caught during the tagging trials in 2008-2009 [7]. It is 
therefore unclear if these are always the same individ¬ 
uals, or if they represent a larger population of animals 
that periodically move through the area. 

In the period from June to August of 2008, there 
were a series of breaches of varying sizes in the 
Dyneema® webbing of one net pen that corresponded 
to shark bites. The same net pen was also breached in 
August 2009 by a small Galapagos shark that entered 
the net pen. The Galapagos was captured and released 
alive by company divers, unharmed except for a small 
dorsal fin notch for later identification. In each 
instance, breaches were sealed immediately on discov¬ 
ery. These incidents underscore the vulnerability of 
even sturdy Dyneema® nylon mesh, and have led to 
a plan for wholesale installation of Kikkonet® rigid 
plastic webbing across the farm. This material has 
been used in Seriola culture in Japan for over 25 years, 
and has been successfully used in crocodile and shark- 
infested waters by a sea-cage barramundi farmer in 
North Queensland, Australia. Kona Blue therefore 
anticipates that the use of Kikkonet webbing will 
reduce mesh breaches to negligible levels, and signifi¬ 
cantly reduce escapes and the attractant nature of the 
escapes to the bottlenose dolphins and sharks. 

Overall, the evidence from the Kona Blue site con¬ 
firms that there are no significant negative impacts 
from any aggregating effects of the net pens on sharks. 
The evolution of a nonterminal, humane plan for man¬ 
aging sharks on the farm site underscores the impor¬ 
tance of commercial experience to improve open-ocean 
farming practices. 

Turtles The threatened green sea turtle (Chelonia 
mydas ) is common in the nearshore waters of the 
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main Hawaiian Islands. The endangered hawksbill 
turtle (Eretmochelys imbricata ) is infrequently found 
in Hawaiian waters. The principal nesting site for the 
green turtle is in the Northwest Hawaiian Islands, on 
French Frigate Shoals [8]. No turtles have been 
observed in the area of the farm site, but it is possible 
that they occasionally transit through the site. If they 
were to do so, the taut-line mooring system and stiff- 
mesh net pens will prevent animals from becoming 
entangled. 

Seabirds The submerged net pens used by Kona Blue 
do not significantly impact seabird populations. The 
farm area itself is infrequently used as a foraging area by 
seabirds. Most seabird activity in the area is confined to 
the fishing “grounds,” which extend to the northwest of 
Keahole Point. 

Monk Seals There are four conceivable ways for 
open-ocean fish farming to have a significant negative 
impact on rare, threatened, or endangered wildlife, 
such as monk seals, dolphins, or whales. The project 
may (1) present a significant obstruction to natural 
migratory patterns, either (2) attract, or (3) repel the 
animals and thereby disrupt their normal behavior, or 
(4) the animals may become entangled in the ropes or 
mesh of the net pens or moorings. 

Monk Seals have been observed at the existing farm 
operation on two occasions, both in association with 
escape incidents from the nylon mesh nets on the 
surface nursery pens that were previously in use at 
that site. (These nylon mesh surface net pens were 
removed in 2006, as Kikkonet was, at that time, not 
yet available outside of Japan.) On each of these occa¬ 
sions, the Monk Seal was preying on the small, escaped 
Kona Kampachi®, but once the school was effectively 
eradicated by predators, the Monk Seals moved away. 
A radio tag allowed movement of one monk seal to be 
tracked from the Unualoha site to a beach on Maui the 
following day, clearly affirming that the animal did not 
take up residence, or become conditioned to the avail¬ 
ability of escapees. 

Dolphins Makako Bay, almost half a mile to the south 
of the farm site, is frequented by large schools of spin¬ 
ner dolphins (Stenella longisrostris ), on nearly a daily 
basis. These animals usually follow a diurnal pattern of 
movement from the Makalawena shelf area to the 


north, along the reef edge to the shallow areas of 
Makako Bay, where they rest for some time during 
the middle of the day. Some concerns were expressed 
during preliminary hearings about the potential for the 
farm operation to interfere with the spinner dolphin 
patterns of movement or resting habits [5]. There is no 
evidence to suggest that this has been the case. There 
have only been some occasions over the 5 years of 
operation offshore when divers or workers on the 
farm site have witnessed spinner dolphins coming any¬ 
where near the net pens. The net pens clearly do not 
impede the usual pattern of spinner dolphin movement 
toward Makako Bay; they do not attract or repel the 
animals, nor do they affect the resting pattern of the 
dolphins. 

Over the last 3 years, the existing farm operation 
has demonstrated a propensity to attract bottlenose 
dolphins (Tursiops truncatus). No bottlenose dolphins 
were previously present on the farm site, but the ani¬ 
mals have begun to appear regularly at the site since 
about October, 2006. Patterns of dolphin movement 
are best characterized as one or two animals, every day 
or so, with occasional instances of groups of up to seven 
or eight animals. There is no regularity to the animals’ 
appearance on the farm site: they may be present all 
day, or only in the morning, or only in the afternoon. 

Kona Blue staff monitor and report on dolphin 
activity to HIHWNMS and NOAA’s PIRO PRD. The 
bottlenose dolphins are probably attracted to the farm 
site by a combination of (1) the presence of the mid¬ 
water structures acting as a Fish Aggregating Device 
and the associated fish community that is present 
around the net pens, (2) the occasional provisioning 
from “leakage” escapes when divers enter or exit a net 
pen, and from the rare larger escape incidents when 
predators have breached the Dyneema nylon webbing, 
and (3) interaction with divers outside of the net pen, 
as the divers move about the farm from boat to net pen 
and back. 

One individual dolphin has taken up residence over 
2009 and 2010. This animal was suffering from a large 
fishing hook and leader line that had become lodged in 
its jaw, and it was present on the farm site almost 
continuously during this period. For many months, 
the dolphin was lethargic and lost weight, but more 
recently (as of late 2009) has appeared to be more active 
and in better condition [9]. The aggregative effective of 
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the net pens for this one animal might therefore be 
interpreted as beneficial. 

No other individual bottlenose dolphin has taken 
up permanent residence at the farm site. There are no 
other animals present on the farm site on around one- 
quarter to one-third of days. Even when other animals 
are present, they are often only there for part of the day, 
rather than the entire day. In October-November, 
2008, for example, dolphins were present for some 
period of time on 22 days out of 34 days [10]. There 
were dolphins present at the farm site for some or all of 
the day on 65% of the days. On 35% of days, there were 
no dolphins reported as observed on the site. Only on 
1 day were six dolphins present. Most other days, there 
were one or two animals present for some portion of 
the day. 

Other dolphin species may be found in and around 
the proposed farm lease area, but they are usually most 
commonly seen on the “grounds” to the south of the 
site. Spotted dolphins ( Stenella attenuata ), rough¬ 
toothed dolphins ( Steno bredanensis) , and false killer 
whales ( Pseudorca crassidens ) have all been observed on 
the “grounds,” or in other offshore waters of the Kona 
Coast, but have not been reported from the farm site. 

In summation, although there has been behavior 
modification in one compromised individual, the pres¬ 
ence of the farm operation has not had a significant 
negative impact on dolphin behavior. The overall long¬ 
term impact on dolphins from the farm operation will 
probably be further reduced. Modifications to net pens 
currently under way should help to alleviate the attrac¬ 
tive nature of the farm to the dolphins, by reducing the 
potential for escapes through mesh breaches, and for 
leakage escapes, and by reducing the amount of time 
that divers need to operate outside of the net pens. 
Kona Blue will continue with the ongoing monitoring 
and reporting of marine mammal activity around the 
farm site, and continues to collaborate in this with 
HIHWNMS staff, PIRO PRD staff. 

Humpback Whales Populations of the endangered 
humpback whale ( Megaptera novaeangliae) winter in 
the Hawaiian Islands, and the project site lies around 
1 mile inside the southernmost boundary of the Hawai¬ 
ian Islands Humpback Whale National Marine Sanctu¬ 
ary (HIHWNMS). Humpbacks are known to frequent 
the entire Kona coast area in winter. The whales move 


throughout the general area, usually following a 
longshore track (north to south, or vice versa). 

Concerns about the reduction in whale habitat by 
the existing project were previously expressed by 
HIHWNMS and DLNR/DAR officials. Some concerns 
were also earlier expressed with the potential for entan¬ 
glement of whales in the mooring lines of the net pens. 
A comprehensive analysis of available records of whale 
entanglement (NMFS Stock Assessments), a review of 
interactions between marine mammals and Hawaii’s 
fisheries [11], and details of marine mammal 
strandings compiled by NMFS Pacific Area Office 
(NMFS-PAO) shows that most whale entanglement 
events occur in slack net mesh (such as drift nets or 
fish weirs), slack vertical lines (such as crab pot or 
lobster pot floats), or surface lines (such as long-lining 
gear). Among all these observations, there is no record 
from any US aquaculture operation of entanglement of 
humpback whales, or other marine mammals, in the 
taut moorings or net panels of fish net pens. With heavy 
mooring gear, and taut lines and mesh, the potential for 
entanglement is considered negligible [12, 13]. 

Furthermore, it appears that the waters in the vicin¬ 
ity of Keahole Point are not as heavily frequented by the 
whales as are other waters of the Sanctuary, further to 
the north (Fig. 9). Observations from workers at the 
farm site suggest that the farm does not interfere with 
the movement of the humpback whales, beyond the 
immediate and obvious exclusion from the waters 
inside the net pens. The distance of around half-mile 
from the inshore side of the net pens to the shoreline 
offers ample room for the whales to move around the 
eastern end of the farm structures, without any chance 
for any funneling or bottleneck effects. 

There is no definitive pattern of whales avoiding, or 
being attracted to the cages. Whales are occasionally 
seen within the lease area. On one instance, the farm 
workers witnessed a humpback on the surface inside 
the mooring grid array; the animal appeared to nego¬ 
tiate its path between the net pens and mooring lines 
with ease. 

As part of the company’s Marine Mammal Moni¬ 
toring Plan (MMMP), farm workers provide data for 
assessing whale abundance and patterns of movement 
around the farm site. The MMMP describes Federal 
recommendations or instructions in the unlikely event 
of any entanglement, and also details ongoing 
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Environmental Impacts of an Open Ocean Mariculture Operation in Kona, Hawaii. Figure 9 

Typical Humpback whale sighting patterns around the Big Island of Hawaii 


reporting requirements for any close interaction with 
humpback whales, or any physical interaction between 
the farm array and other marine mammals. 

Recreational Use Impacts The farm site lies offshore 
from the Natural Energy Laboratory of Hawaii Authority 
and the Kona International Airport, and as such, has 
little effect on shore-based recreation. The heavily used 
public recreation area of Kekaha Kai State Park 
(Mahai’ula) lies more than 3 miles further to the north. 

A survey of recreational activity in the general area, 
north of Keahole Point was conducted prior to the farm 
installation, from August to September, 2001, in con¬ 
junction with the original farm site environmental 
assessment [14]. The survey covered 2 months of sum¬ 
mer conditions, which was considered the best means 


of ensuring that the data represented the heaviest use of 
the area. The overarching finding of the survey was that 
the area is only used for transit: of the 150 observations 
made over the 61 consecutive days of the survey, only 
one boat was seen within the farm site - a boat 
transiting through the area. Most activity in the general 
Keahole-to-Unualoha area was recreational dive boats 
and commercial dive tour operations along the reef and 
shoreline south of Unualoha Point (directly inshore 
from the proposed farm site), and in Makako Bay itself. 

Observations by the Kona Blue staff on the farm site 
suggest that this trend continues - the use of the farm 
lease area is merely for transit. Fishing boats now occa¬ 
sionally troll lines close to the central area, to try to take 
advantage of the aggregative effects of the net pens. 
There are no records of catch rates around the farm, 
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but anecdotal evidence indicates that catches are 
primarily ono (wahoo, Acanthocybium solandri ), with 
infrequent catches of ahi (yellow-tuna, Thunnus 
alalunga). 

Kona Blue’s permit allows restricted public activi¬ 
ties in the lease area, precluding anchoring, scuba 
diving, spearfishing, or swimming within the 90 acres. 
These limits are considered the minimum needed to 
protect the company’s investment, to limit their liabil¬ 
ity (and retain insurance coverage), and to assure 
public safety. Fishing by the public from unanchored 
boats (trolling, or linefishing from drifting boats) is still 
permitted, but with the caveat that any fishing lines 
that become entangled in the net pen mooring lines 
must be left in place and cannot be retrieved by divers. 
The company also requests that fishermen not troll 
through the center of the farm site because of the 
potential for fishing lines to entangle divers, or for 
lures to hook into mooring lines or nets. Boats 
transiting the net pen area are also requested to observe 
a slow “no-wake” boat-speed to maximize safety for 
divers. Unguided recreational scuba diving or 
unauthorized commercial scuba dive tours are not 
permitted within the lease area because of liability, 
safety, and security concerns. 

The loss of access to recreational activities within 
this relatively small area of ocean space is not consid¬ 
ered significant. Kona Blue’s ongoing observations 
affirm that there is virtually no fishing or other recre¬ 
ational use of the lease area, or the areas adjacent to the 
lease area, beyond trolling, which is probably enhanced 
by the farm’s presence. 

Viewplane Aesthetics Community value judgments 
and perceptions of how the oceans should be used 
largely govern the impact of the project on the 
community’s aesthetic enjoyment of the area. In 
community meetings, Kona Blue generally enjoys 
strong support for the broad goals of the company. 
There is wide recognition of the severely depleted 
status of bottom fish species in Hawaii. The 
awareness of the global fisheries crisis has recently 
been amplified by several scientific studies, such as 
that of Worm et al. [15], which projected a collapse 
of world fish stocks by 2048, unless significant remedial 
changes are made to fisheries and marine ecosystem 
management. 


The visual impact of the project is minor, compared 
with the adjoining properties of Kona International 
Airport and the aquaculture operations at the Natural 
Energy Laboratory of Hawaii Authority (NELHA). The 
major visual impact from the farm operation is from 
the experimental surface pens and the feed barge. There 
is also the additional presence of work and dive boats, 
and harvest boats, on some days. However, the impacts 
of these structures and activities are not significant, 
given the distance from the nearest residences, more 
than 3 miles away. 

There is general community acceptance that the 
project fits in well with the overall ambience of inno¬ 
vative aquaculture at NELHA, and the need for Kona 
to develop alternative industries beyond tourism. 
Fisherfolk and other mariners recognize the validity 
of the criteria that Kona Blue has used to select this site 
(c.f. deeper or shallower sites), and have not expressed 
a strong preference for the project to be located 
elsewhere. Applicants for farm permits in other areas 
of the Kona Coast (around Kawaihae) have, on occa¬ 
sion, been told that their project would more appro¬ 
priately be located “down near NELHA and Kona 
Blue.” 

Cultural Resources , Practices , and Mechanisms for 
Impact Prior to the 1801 lava flow that inundated 
the area, Keahole was the site of the largest fish pond 
in the Hawaiian islands. The Pai’ea pond (reputedly 
King Kamehameha’s favorite pond) was approximately 
3 miles long and 1/2 mile wide; canoes were used to 
traverse from one side to the other. The farm site is 
directly offshore from where Pai’ea once stood. Fish 
farming could therefore be considered historically and 
traditionally appropriate to the area. 

The farm lease area is over a mile from the tradi¬ 
tional marlin, tuna, and wahoo fishing grounds 
(Fig. 10). The site is too deep for free-diving or scuba 
diving activity but suitable for “blue-water” spearfishing. 
Usually, however, blue-water spearfishing is practiced 
close to a point or drop-off, rather than over bare sand 
substrate around 200 ft deep. There are no significant 
benthic plant or animal populations in the farm lease 
area, and there are virtually no benthic or pelagic fishing 
activities in this depth range. Kona crabs and nabeta 
(.Xyrichtys pavo) are the only benthic resources that 
occur on sand bottom at this depth, but informants 
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Environmental Impacts of an Open Ocean Mariculture Operation in Kona, Hawaii. Figure 10 

Kona Blue's offshore fish farm site in relation to primary fishing areas. The farm site is well inside of both the 100 fathom 
(200 m) trolling ledge along the "grounds" offshore of Keahole Point and the 40 fathom (80 m) ono lane. Reef fishing 
and opelu ko'a are found well inshore of the proposed site, along the edge of the reef, in waters up to 120 ft deep (40 m). 
Fishing grounds for opelu at night are usually deeper than 40 fathoms (80 m) 


suggest that the currents are too strong for any signif¬ 
icant fishing effort this close to Keahole Point [16]. 

The only potentially impacted cultural resource 
that was cited during extensive discussions with com¬ 
munity and kupuna (elder) groups for the original 
farm site was the several opelu ko’a (“holes” or school¬ 
ing places for mackerel scad - Decapterus macarellus ) 
that occur in the general region. The locations of these 
ko’a are considered to be part of traditional marine 


lore, and are considered inappropriate for publication, 
or for sharing outside of the families or community 
groups who have traditionally fished these ko’a. How¬ 
ever, in private meetings with the most knowledgeable 
kupuna, the locations of the traditional opelu ko’a were 
determined to be outside of the proposed project loca¬ 
tion [17]. Opelu aggregations usually occur in water 
around 120 ft deep, close to reef drop-offs, and well 
shoreward of the farm area. 
















3574 


E 


Environmental Impacts of an Open Ocean Mariculture Operation in Kona, Hawaii 


Access to, or practice of any other customary activ¬ 
ities has not been significantly constrained by the farm 
array or operations. The exclusive control over the 
waters (and the fish) inside the net pens is consistent 
with traditional and cultural practices that identified 
fish traps or lobster traps - and the animals therein - as 
the private property of the trap owner. The same 
principles apply here. 

Global Impacts: Fish Meal and Fish Oil Usage 

Fish Meal and Fish Oil Usage 

Fish such as Seriola rivoliana (Kona Kampachi®) usu¬ 
ally feed toward the top of the trophic chain in the wild. 
They therefore possess digestive systems and nutri¬ 
tional requirements that are adapted for feeds with 
high protein and lipid levels, and low levels of 
carbohydrates. 

Fish meal and fish oil usage in fish feeds can be 
considered a valid use of a natural, sustainable, renew¬ 
able resource, so long as the fishery from where the fish 
meal and fish is sourced is responsibly managed. 
Although stocks such as the Peruvian anchovetta fish¬ 
ery are sustainable in the sense that they are very well 
managed, they are not scalable. If mariculture is going 
to fulfill its potential for increasing seafood consump¬ 
tion to meet growing demands, then some alternative 
sources of proteins and oils will be required. 

Kona Blue has therefore been focused on reducing 
the inclusion rate of fish meal and fish oil, such as 
Peruvian anchovies, from targeted reduction fisheries 
and increasing the use of agricultural oils and proteins, 
such as soy, canola, wheat, corn and poultry meal, 
and oil. 

Improving Feed Conversion Efficiencies: An Evolu¬ 
tionary Approach Though efficient use of fish meal 
and fish oil from targeted reduction fisheries is both 
rational and justifiable, this by no means suggests that 
these resources are unlimited, or that alternatives 
should not be searched for. If open-ocean mariculture 
is to develop into a food production system that can 
provide a significant proportion of the nutritional 
needs of a growing planet, then additional sources of 
sustainable proteins and oils for feedstuff must be 
found for this industry. The arc of Kona Blue’s feed 
development strategies is perhaps instructive of 


directions that open-ocean mariculture, as a global 
industry, might follow to achieve such scalable 
sustainability. 

Initially, Kona Blue Water Farms fed the Kona 
Kampachi® with a diet that was considered “organic” 
by European standards. At the time, USDA did not have 
(and still does not have) Organic standards for aqua¬ 
culture feeds. In the EU, however, Organic fish food 
was considered to be that which was most similar to the 
animal’s diet in the wild. This feed, therefore comprised 
largely comprised fish meal and fish oil derived from 
Peruvian anchovies. 

With the recognition of the need for more scalable 
feedstuff alternatives, however, Kona Blue worked with 
the feed vendor to develop a new diet that lowered the 
inclusion rate of fish meal and fish oil from Peruvian 
anchovies to a combined total of 50%. This diet 
included soybean meal, wheat gluten, canola, and 
other grain proteins and oils. The biological efficiency 
for this diet, however, was still suboptimal, with a fish- 
in:fish-out ratio (FIFO) of over 2:1. (i.e., an input of 
more than 2 lb of anchovies for each pound of Kona 
Kampachi® produced). 

The inclusion rate of agricultural proteins in diets 
for marine piscivorous fish is limited by the presence of 
a range of “anti-nutritional factors” in the grains and 
less-purified meals. (Aside: although often described as 
carnivorous, most marine fish such as groupers, snap¬ 
pers, jacks, and bream, are perhaps more accurately 
described as “carbohydrate intolerant.” They require 
diets that are high in protein and lipid, and low in 
carbohydrate. There is no specific nutritional require¬ 
ment that these fish eat meat). For this reason, soybean 
meal is restricted to about 20% of the diet for most 
marine fish. To reduce the fish meal and fish oil inclu¬ 
sion rate further, and to further lower the FIFO, would 
therefore require proteins and oils from other sources. 
By-products from both edible fishery processing and 
poultry processing were therefore included in the 
revised Kona Blue diet, allowing the Peruvian anchovy 
inclusion rate to be further reduced to 30% of the 
ration: 20% fish meal and 10% fish oil. 

Inclusion of poultry processing by-products, how¬ 
ever, meant that some customers, such as Whole Foods 
Markets (WFM), a high-end organic and natural foods 
retailer, would no longer carry Kona Kampachi®, even 
if the poultry used for the by-products was of Organic 
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origin. This position by WFM was out of consideration 
for those of their customers that were vegetarians, but 
still wanted to eat fish. WFM asserted that these cus¬ 
tomers would not want to eat that fish if fish had eaten 
a pellet that contained proteins or oils that were derived 
from mammals or birds. Kona Blue appealed to WFM 
to review their position, given the importance of reduc¬ 
ing mankind’s global footprint on the oceans, that is, 
the reliance of humans on natural marine resources - 
but as of 2010, there has been no change in this policy. 

Kona Blue has recently tested two diets that 
completely eliminate from the Kona Kampachi® diet 
any fish meal and fish oil sourced from targeted 
reduction fisheries, and any land animal processing 
by-products. These innovative diets use processed by¬ 
products from sustainably managed fisheries intended 
for human consumption. As the trimmings from these 
sources would otherwise have been discarded, used as 
fertilizer, or burnt as fuel, the use of these fish meal and 
fish oil products in the Kona Kampachi® diet represents 
an ideal reuse of natural resources. These diets there¬ 
fore would result in a zero FIFO ratio, that is, no 
targeted reduction fishery by-products included in the 
diet of the end product. 

Alternative Feedstuffs for Open-Ocean Mariculture 

Kona Blue is involved in testing a range of alternative 
feedstuffs for Kona Kampachi® diets, which also offer 
potential for other species of marine fish. Alternative 
soy products, other agricultural grain concentrates, 
yeast, and other single cell proteins, edible fishery 
by-products and - more recently, with the boom in 
microalgae culture for biodiesel production - defatted 
microalgae by-products, have all either been tested, or 
are under development for Kona Kampachi® feed trials. 

Kona Blue has tested a range of soy-based diets, 
with soy protein concentrates and omega-3 oil rich 
strains of soybeans. These trials suggest that the inclu¬ 
sion rate of soy protein concentrates cannot, by itself, 
exceed the same 20% threshold that limits soybean 
meal. Above this level, growth rates and feed conver¬ 
sion ratios are depressed. With the inclusion of taurine 
in the formula, however, soy protein concentrates 
could replace fish meal as the source of protein up to 
40% of the diet with no detriment to fish growth rates 
or feeding efficiencies. 


There is a diverse array of edible fishery processing 
by-products that are available for use in aquaculture 
diets, and this direction offers tremendous potential for 
further development. The processing by-products from 
most wild salmon runs, for example, are woefully 
underutilized, and are often disposed of directly back 
into the rivers from which the fish are taken. Logistical 
and economic constraints limit the use of these trim¬ 
mings, however, as the processing plants are usually 
small and isolated, the salmon runs are only of short 
duration, and storage and transport of fish meal or fish 
oil by-products from these villages to reduction facili¬ 
ties and feed mills is a challenge. Development of fish 
silage systems offers one potential, partial solution. 

However, even the less-seasonal, larger-scale 
processing of farmed salmon in more centralized plants 
presents difficulties for the utilization of by-products. 
For biosecurity reasons, most fish feed plants will not 
run salmon-derived feed stuffs through their machin¬ 
ery because of the potential for contamination of feeds 
from viruses, bacteria, or other pathogenic vectors that 
may be found in the by-products. Screening for known 
pathogens is not an adequate solution: even though the 
chance could be considered very slim that some 
unknown pathogen may be unwittingly dispersed via 
extruded feed, the potential catastrophic consequences 
of such widespread and rapid disease dissemination are 
sufficient to ensure that no such chance be offered. This 
therefore excludes almost all large feed mills in salmon¬ 
farming regions from using salmon by-products. 

Similar inefficiencies are found in the reuse of 
trimmings from the pollock fishery (Theragra 
chalcogramma ) in the northern Pacific. This fishery 
primarily processes most of the catch at sea, into 
surimi. Trimmings from these fish constitute around 
65% of the wet weight of the catch. For a fishery that 
has averaged around 1.3 million metric tonnes, this 
then represents around 850,000 tonnes of wet weight 
by-product annually that could be converted into fish 
meal and fish oil. For many years, much of this by¬ 
product was discarded back into the ocean, or the 
rendered fish oil was burnt in the diesel generators of 
the processing vessels. Some 8 million gallons of fish oil 
in Alaska is largely disposed of as biodiesel [18]. More 
recently, some proportion of these trimmings have 
been used to make a high-quality white fish meal that 
is largely exported to Asia, where it is valued in feeds for 
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farmed eels. However, the proportion of by-product 
that is reused or recycled is not reported. Again, eco¬ 
nomics and logistics conspire against development of 
a rational supplement to targeted reduction fisheries. 
The increasing prices of fish meal and fish oil, however, 
driven by the growing demand for animal feeds from 
developing economies (notably China and India) may 
be a greater incentive to resolve these constraints. 
Edible fishery by-products may yet play a significant 
role in aquacultue feedstuff sourcing. 

There is ample evidence that some or all of these 
innovative feedstuffs could help to reduce the demand 
for fish meal and fish oil from clupeids in the medium - 
to long-term. 

Is Aquaculture “Fishing down the Food Chain”? 

Much interest has been recently focused on the prob¬ 
lem of “fishing down the food chain,” beginning with 
Pauly et al.[19], and Taylor et al.[20]. This is the trend 
over time for commercial fisheries - driven by serial 
stock depletion - to shift their target species to those 
lower on the trophic pyramid. Fishermen first start out 
exploiting the high-value, top-end predators, then 
move on to mid-level predators, and then down toward 
herbivores and detritivores - what was previously con¬ 
sidered bycatch. Fisheries generally start out targeting 
the larger, sweeter-tasting species - tunas, snappers, 
groupers, and such. As these become increasingly 
scarce, fishermen apply greater fishing power, and fish 
longer and deeper, retaining or targeting what was 
previously considered “trash.” The argument portends 
that at some stage, the food web is reduced to an ocean 
full of jellyfish. “Fishing down the food chain” is 
a condemnation of the inherent unsustainability of 
most commercial fisheries management - or rather, 
mismanagement. 

Fish farming has somehow been implicated in this 
practice on the basis of farmed fish being fed pellets 
that are partly made up of fish meal and fish oil derived 
from anchovies, menhaden, sardines, or the like. These 
fish (collectively, the clupeiforms) usually form the first 
step in the ocean food chain beyond primary produc¬ 
tion. Some scientists and anti-aquaculture advocates 
misconstrue or deliberately misinterpret the complex¬ 
ities of ecological and economic cause-and-effect, and 
represent the use of clupeiforms as feed for farmed fish 


as wanton. This has been led by respected institutions 
such as the Monterey Bay Aquarium [21, 22], but has 
also spilled into mainstream media, such as the NY 
Times [23], Conservation Magazines article on “10 
Solutions to Save the Ocean” [24], The Ecologist [25], 
and The Economist [26] . The notion that aquaculture is 
guilty of “fishing down the food chain” is now lodged 
within the public consciousness. 

The bottom of the food chain, however, is where 
fishing should preferentially be done. It makes far more 
sense to use herbivores or planktivores from the base of 
the trophic pyramid as either human food or feed for 
farmed fish, than to be targeting top-end predators. 
This makes economic sense, but it also makes sense 
from other perspectives: it is better for the oceans 
ecosystems, it is better from the viewpoint of bioener¬ 
getics transfer through the trophic pyramid, it is better 
for consumer health, and it makes for better fisheries 
management. 

The economics are simple: Peruvian anchovies and 
menhaden are not highly valued in the market, so they 
are cheap. Maybe this will change in time, and prices 
for anchovies and sardines will increase, as more people 
develop a taste for oily baitfish, but it is more likely that 
most consumers will still prefer larger piscivorous 
marine fish as sashimi or fillets. 

The oceans ecosystem offers several reasons why 
the bottom of the trophic pyramid is a better place 
for humans to extract nutrition from the sea. It is, 
most simply, a matter of mass and mathematics. Her¬ 
bivorous fish are more abundant, with greater biomass. 
Catching 1,0001 of Peruvian anchovies has little impact 
on the 6 million ton spawning biomass (around 
0.025%). By contrast, 1,0001 of tuna represents around 
10% of the Bluefin Tuna spawning stock in the Western 
Atlantic, the biomass of which is currently estimated at 
less than 10,000 t [27]. 

Moreover, Clupeiforms are classic “r-selected” spe¬ 
cies, with their smaller body size, faster maturing, and 
shorter life spans [28]. They are highly opportunistic: 
a decrease in population size in Peruvian anchovies 
often results in increased recruitment from the next 
spawning. From an ecological perspective, these species 
are precisely where fishing effort should be targeted, 
not the larger, more vulnerable, slower-growing 
“K-selected” species at the top of the food chain. In 
agricultural terms, most of the crops that humans raise 
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are strongly cc r-selected” - wheat, corn, barley, rice, 
while targeting a “K-selected” species in agriculture 
might be the equivalent of chopping down oak trees 
to eat the acorns. 

Herbivorous clupeiforms also grow and reproduce 
faster. Menhaden stock resilience to fishing pressure is 
“high” [29], with a population doubling time of only 
15 months. Northern Bluefin Tuna, by contrast, have 
“low” stock resilience, and a minimum population 
doubling time of 4.5-14 years [30]. Therfore, if half 
the menhaden were harvested, it would take 15 months 
for the stock to recover. However, if half the tuna 
population was taken, it would take, at a minimum, 
between 4.5 and 14 years to recover. Southern bluefin 
tuna also do not begin to spawn until they are perhaps 
11 years old, and may live to “at least 40 years of age” 
[30]. However, Peruvian anchovies are sexually mature 
within 1 year, and only live for around 3 years. The 3- 
year old anchovies then die and fall to the ocean floor. 

The public health imperative should also provide 
impetus to source fish meal and fish oil from lower 
down the food chain. Menhaden and anchovies filter 
algae and zooplankton directly from the water. They 
are therefore high in heart-healthy omega-3 oils, yet 
low in the persistent organic pollutants, such as mer¬ 
cury and PCBs. These pollutants, however, are concen¬ 
trated as they move further up the food chain. It is 
primarily top-level predators - sharks and tuna - that 
are on FDA advisories for pregnant and nursing 
mothers, and children. By contrast, an aquaculture 
species that can achieve a feed conversion efficiency of 
close to 1:1 (FIFO, or Fish In : Fish Out) contain 
essentially the same contaminant loading as the 
clupeiforms at the base of the food chain. 

Clupeiform fisheries are also more readily man¬ 
aged, with relatively simple stock dynamics and ecosys¬ 
tem interactions. The major inputs to clupeiform 
stocks are the spawning biomass and primary produc¬ 
tivity, which is usually driven by the strength of the 
nutrient-rich upwelling. Most of the fisheries occur 
within the EEZ of a single nation, where there are direct 
incentives for sound management and enforcement, 
and where access can be regulated. Tuna and swordfish, 
by contrast, are highly migratory species. Donut-holes 
of high-seas waters, beyond any country’s 200-mile 
zone, provide opportunities for distant-water fishing 
nations to concentrate their boats and effort. Attempts 


at managing tuna stocks are typified by the Interna¬ 
tional Commission for the Conservation of Atlantic 
Tunas (ICCAT), which has 46 members, and almost 
no enforcement capabilities. And while Hawaii’s long- 
line fishery targeting big-eye tuna may be very well 
managed, for example, heavily subsidized European 
or Asian purse-seiner fleets target the juveniles of the 
same stock in the South-Western Pacific. 

Moreover, the carbon footprint of clupeiform fish¬ 
eries is minimal. These fish are usually taken by purse- 
seiners, working close to the coast, encircling schools 
containing hundreds of tons at a time. The carbon 
footprint for species higher on the foodchain is much 
higher, with fish being caught by diesel-powered trawlers 
or trailers, or - for bluefin tuna and swordfish - by 
harpooning the fish, one at a time. 

Most importantly, however, it is far better from 
a bioenergetic perspective to target fish closer to the 
bottom of the food chain. Applying the 10% trophic 
transfer rule means that the 1 lb of wild tuna sashimi on 
a consumer’s plate needed to eat 10 lb of anchovies - or 
its equivalent in fish meal and fish oil. Or maybe, if 
there were two steps in the trophic pyramid, each 
pound of wild tuna required 100 lb of anchovies to 
first be converted into 10 lb of mackerel. 

Aquaculture, by contrast, is always a single step - 
clupeiforms-to-crop. But aquaculture can also use 
alternative agricultural proteins and oils, such as corn 
and wheat gluten, soy proteins and oils, canola, and 
other animal processing by-products. These other pro¬ 
teins and oils reduce the fish meal and fish oil inputs to 
the extent that some of the purported “carnivores” can 
thrive on a diet that is around 20% fish meal and fish 
oil. On the “sustainability quotient” - the number of 
pounds of fish-in to produce 1 lb of fish-out (the FIFO) 
can then attain the perfectly efficient goal of parity, or 
1:1 (i.e., every pound of Peruvian anchovies in the 
farmed fish diet produces 1 lb of product). The result 
is efficient conversion of a low-value anchovy into 
a high-value marine fish, without disrupting the fragile 
top of the trophic pyramid. 

Larger, wild fish are more bioenergetically wanton. 
Wild fish lose energy through inefficient digestion, in 
hunting prey, trying to avoid predation, spawning, and 
succumbing to natural mortality. As wild fish grow larger, 
they also become increasingly inefficient - a greater pro¬ 
portion of energy is needed to maintain the animal’s 
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metabolism. Any bycatch will compound these ineffi¬ 
ciencies further. The global bycatch ratio is around 
0.28 lb of discard for every pound of target species. 

Earlier estimates [20] suggested that farmed fish 
might be more efficient than wild fish, based on a 
single trophic step, by a factor ranging from 2 to 5. 
Combining the life cycle inefficiencies, trophic ineffi¬ 
ciencies, and bycatch inefficiencies of wild fish, how¬ 
ever, means that farmed fish may be more efficient than 
wild fish by a factor of around 60 (Table 1, and [31]). 


This reasoning does not advocate for greater fishing 
effort on anchovies. To the contrary - caution is called 
for. While most of these stocks are sustainably managed 
at current levels, they could not withstand any greater 
pressure. These clupeiform stocks should continue 
to be very closely monitored and highly regulated. 
Large marine protected areas should also be established 
to allow some clupeiform-based ecosystems to flourish 
in their natural state (rather than attempting ecosystem 
based management). But it is imperative that the 


Environmental Impacts of an Open Ocean Mariculture Operation in Kona, Hawaii. Table 1 Relative ecological 
efficiencies of farmed and Wild-Caught fish. The table shows the compounded cost in terms of anchovy-equivalents for 
farmed and wild-caught fish. Low-end estimates and high-end estimates are provided for each type offish and compared 
cross-ways to obtain a lowest relative rate and highest relative rate 



Farmed fish 

Wild-Caught fish 

Global mean 


Low-end 

estimate 

High-end 

estimate 

Low-end 

estimate 

High-end 

estimate 

Ratio of wild to 
farmed 

Life cycle efficiency 3 

1 

1 

3 

10 

6 

Trophic transfer 
efficiency 6 

1 

8 

10 

100 

7.3 C 

"Bycatch" efficiency 

1 

1 

-| (4) 

n e 

1.3 d 

Compounded "cost" 

1 

8 

30 

11,000 

57 


The lowest relative rate extrapolated from this table is that the least-sustainably farmed fish are around 4x more ecologically efficient than 
the most sustainably harvested wild fish (i.e., 30:8). The highest relative rate is that the most sustainably farmed fish could be 11,000x 
more ecologically efficient than the least-sustainably harvested wild fish (i.e., 11,000:1). The Global Mean of wild fish efficiency to farmed 
fish efficiency is around 57x 

a There are no published estimates of the relative life cycle efficiencies of farmed versus wild fish. However, fish that reach reproductive age 
in captivity can see Feed Conversion Ratios increase by factors of 5 or 10 over juvenile and sub-adult fish. Natural mortality and the 
nutritional cost of maintenance of basal metabolic processes during periods of food depravation also increase the "Economic" Feed 
Conversion Ratio for wild fish populations 

b ln 1997, food conversion efficiencies (FCE) for farmed marine fish and farmed salmon were around 5:1 and 3:1, respectively (Naylor et al. 
2000). By 2010, however, FCEs are projected to reach 1.5:1 for farmed marine fish, and as low as 1.2:1 for farmed salmon (Tacon 2005). Kona 
Blue has been able to culture Kona Kampachi® on a diet that equates to a 1:1 ratio of wet-fish-in to wet-fish-out. However, if a less- 
sustainably farmed fish is fed a pellet high in fish meal and fish oil (say, to meet the Scottish Soils Association's Organic standards, with 
around 80% fish meal and fish oil), this diet could equate to around 4 lb of wet anchovy-equivalents for every 1 lb of dry pellet (a wet-fish 
to fish-meal ratio of 5:1 is considered standard). On this diet, most commercially farmed species might have food conversion ratios of 
around 2:1 (dry pellet to wet fish), implying an FCE of 8 lb of wet-fish-in for every 1 lb of wet-fish-out 

c Tacon's (ibid) estimate of FCEs for farmed salmon and farmed marine fish might be conservatively pooled at, say, 1.5:1, that is, 1.5 lb of 
anchovy-equivalents for every pound of farmed fish produced worldwide. There is a differential of around 1.1 trophic levels between 
global fishery landings (with a mean trophic level of around 3.3) and the Peruvian anchovetta fishery (with a trophic level of around 2.2: 
Pauly et al, [19]). At a presumed 10% biomass transfer efficiency up each trophic level, this implies 111b of anchovy-equivalents to produce 
a pound of harvested wild fish. The median ratio of wild to farmed trophic transfer efficiencies can therefore be estimated at 11:1.5, or 7.3:1 
overall 

d Harrington et al. (2005), report a "nationwide discard-to-landings ratio of 0.28" (i.e., for 3.7 million tons landed, some 1.06 million tons 
were discarded). However, for highly selective fishing methods, such as harpooning, bycatch is effectively zero, as for farmed fish 
e For finfish, the ratio of bycatch to target fish (in the Northern Pacific) can be as high as 11:1 because the bycatch is either too young, out of 
season, or the vessel has no permit to keep it (Alverson 1998) 
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fisheries at the base of the food chain be better 
managed, and environmentally sound aquaculture 
endorsed so that pressure can be taken safely off the 
top of the food chain. 

Fishing at the bottom of the food chain should 
therefore be encouraged preferentially over any other 
kind of fishing. This is not a function of recent 
overfishing: even 100 years ago, this principle would 
have still held true. People always should have been 
fishing at the bottom of the food chain. To continue 
to accuse aquaculture as being part of the problem of 
“fishing down the food chain” is therefore disingenu¬ 
ous. Aquaculture is an important part of the solution to 
the feeding of the growing humanity. To assert other¬ 
wise confuses the consumer and discourages the policy 
shifts needed toward more sustainable aquaculture, 
and healthier oceans. 

Future Directions 

The Challenge 

Open-ocean mariculture must expand, but it must 
expand in an environmentally responsible manner. 
While opponents may cite the precautionary principle 
as reason to not move forward, or to do so only cau¬ 
tiously, with experimental site permits or restrictive 
legislation, there is an imperative for action. Extractive 
pressures cannot be increased on already depleted wild 
fishery resources, and the public health costs of limited 
seafood consumption cannot be accepted any more. 
These two trends must be turned around by finding 
alternative sources of healthful seafood. Open-ocean 
mariculture is the only practical means of achieving 
this. To urge inaction, then, is in effect advocating for 
either a greater fishing pressure on wild stocks, or the 
increasing human mortality and suffering from heart 
disease and stroke. If the consequences of inaction are 
so inevitable and so severe, and the potential conse¬ 
quences of action are only slight and temporary, then 
the precautionary principle insists that action must be 
taken. Therefore, the requisite regulatory framework 
for industry growth must expediently be set in place 
and the needed technologies and support industries 
developed to proceed forward. 

The single greatest challenge will remain that of 
effecting a change in the mind-set of the environmental 
community to the extent that there is broad acceptance 


of the need for expansion of open-ocean mariculture 
as a conservation tool, that is, as an alternative to 
targeting larger wild fish. Until this is accomplished, 
then, this industry will continue to be smeared with 
irrational fears, community prejudice and consumer 
bias, and the promise it offers will be left woefully 
unfulfilled. 

Further! Deeper! 

The Kona Blue operations to date have produced 
annual harvests of around 500 T, and have demon¬ 
strated clearly that this can be achieved without any 
significant impact on ocean ecosystems or resources. 
The key now is to ensure that these impacts remain 
insignificant, as the industry scales. Expansion oppor¬ 
tunities, in terms of broader acceptance and greater 
access, will be conditional on future farm sites being 
located further offshore and in increasingly deeper 
waters. The industry therefore needs to develop the 
technologies to support these trends and to ensure 
that carrying capacities and prudent biosecurity prac¬ 
tices are adhered to as the industry grows. 

There are economic and environmental drivers for 
larger-scale open-ocean mariculture operations in 
deeper water, further offshore. As the automation sys¬ 
tems for net pen management become increasingly 
sophisticated [32], sea state becomes less of an imped¬ 
iment to more remote, more exposed sites. The major 
disincentive then becomes travel time to and from 
a site. Beyond about a 20 mile distance, it is increasingly 
difficult to maintain a farm site with day-workers, and 
some on-site residence becomes necessary. This, then, 
argues for further scaling: once a farm site is manned 
24 h, with staff changes twice-weekly or weekly and 
periodic delivery of feed and other supplies, there are 
strong commercial reasons for the operation to grow in 
scale to support the inherent fixed costs. 

Larger scale net pens present some challenges in the 
open ocean, particularly with fish handling, but there 
are dvantages as well. Larger pens are more cost- 
effective in terms of cost per unit of volume (i.e., cost 
per cubic meter) and in terms of managing the stock. 
A farmer manages a net pen, rather than a fish: a net 
pen of 3,000 m 3 and 50,000 fish requires roughly the 
same level of management as a net pen of 24,000 m 3 
and 400,000 fish. Similarly, where there is a significant 
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cost to the netting material, or cost to maintaining the 
netting, then the lower surface area to volume ratios of 
larger net pens are increasingly attractive. Larger net 
pens also require larger hatcheries, to produce suffi¬ 
cient fish in one cohort to stock the pen, and this results 
in greater hatchery efficiencies. 

New netting materials under development could 
revolutionize the net pen designs. Kikkonet™ appears 
to offer significant advantages over multi-strand nylon 
or cotton webbing, with greater rigidity and predator- 
breach resistance reducing the risk of escapes and easier 
cleaning capabilities with the single monofilament mate¬ 
rial. Similar advantages may be offered by new brass alloy 
materials under development, which almost completely 
eliminate biofouling and the attendant fish health con¬ 
cerns, as well as offering improved resistance to breaches 
from predators or mechanical tearing. Both Kikkonet™ 
and brass webbing, however, require rigid frames in 
open-ocean environments: constant wave and current 
movement can quickly destroy netting material if the net 
pen frame allows any excess movement. Each of these 
materials represents a significantly higher capital cost 
than nylon or cotton netting, but there are tremendous 
operational savings to be gained from reduced risk of 
failure and reduced labor for net changing, net washing, 
antifoulant dipping, and net loft work. 

Although much discussion has focused on the 
advantages of surface and submerged net pens and 
the relative operational efficiencies of each, this may 
be the single most important deciding factor for pen 
design: rigid-framed nets such as the submersible 
Sea Station® and Aquapod® can support more robust, 
rigid netting materials. Flexible framed nets such as 
PolarCirkel®, or those that rely on gravity to maintain 
net shape, such as Wavemaster® can only support pli¬ 
able nets of nylon and cotton in the higher wave and 
current conditions offshore. Where sea conditions or 
predator prevalence demand the more robust, rigid 
mesh forms of brass or Kikkonet, then submersible 
rigid-framed pens would appear to have operational 
advantages. 

Letting Go 

The potential for untethered “trans-ocean drifter net 
pen” fish farms has, up to now, been the stuff of science 


fiction [3]. However, the concept of untethered pens 
offers the potential for far larger scaling of operations, 
with almost negligible potential for any environmental 
impact; a net pen in very deep water is a nonpoint 
source for effluent impacts on water quality and sub¬ 
strate, and there is almost no potential for interaction 
with wild fish stocks (apart from aggregative effects). 
There are also advantages for fish health management 
on the farm - a drifter pen is essentially perpetually 
fallow. Drifter pens that ride regional ocean eddies or 
powered pens (either towed by a surface vessel, or 
where the fish containment is integrated into the vessel 
hull) are already moving from concept to prototype; 
within 5-10 years, some form of untethered net pen 
may become a commercial reality. Several examples are 
worth noting: 

In June, 2008, a trial in Puerto Rico tested a sub¬ 
merged, self-propelled, 3,200 m 3 . Aquapod as a 
demonstration of untethered aquaculture operations. 
The system was able to achieve 30 cm/s forward speed 
by using two electrically driven 2.5 m diameter pro¬ 
pellers powered by a diesel generator set onboard 
a small towed vessel. The Aquapod was able to 
hold position in 25 cm/s currents and was easily 
maneuverable. The concept of remote-controlled or 
autonomous operation was judged feasible using 
cage-mounted sensors and buoy-mounted GPS and 
communications. 

In October 2009, the State of Hawaii approved 
a tuna farm permit for a 247 acre site offshore of the 
Kohala Coast (north of the Kona Blue farm site) in 
water over 1,320 ft deep, for up to 12 untethered 
Oceanspheres. With no anchors, it is proposed that 
these fish farming platforms will be held in position 
by self-propulsive forces powered by a hybrid Solar 
Ocean Thermal Energy Generator. Hawaii Oceanic 
Technology, Inc. plans to deploy the first of these net 
pens in 2011. 

A regional drifter pen concept under development 
by Kona Blue - christened the Velella system - is 
projected to remain within a prescribed geographical 
area without anchoring, by riding the eddies in the lee 
of oceanic islands such as Hawaii, or by riding the gyres 
found in semi-enclosed seas, such as the Sea of Cortez 
or the Gulf of Mexico. The advantages of this concept 
are that it then enables the operation to consistently 
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stock, feed, tend, and harvest the fish in the pen, as well 
as changing crew and other maintenance tasks. By 
being untethered, the Velella can be located in very 
deep water, beyond normal mooring depths, and any 
potential environmental impacts are reduced to de 
minimis levels. 

As untethered net pens do not occupy a fixed geo¬ 
graphical space, they do not require a lease but rather 
are more properly regulated as non-powered vessels. 
Where there is no point source effluent from a motile 
pen, the monitoring and regulatory requirements may 
also be those applied to vessels, rather than fixed farm 
locations. Projects such as Velella might also pass 
through or be deliberately sited in international waters, 
where no permits are presently required and where no 
regulatory framework is currently in place. This regu¬ 
latory vacuum should be remedied, to both govern its 
growth but also encourage the development of this 
technology. 

IMTA 

Much interest is often focused on Integrated Multi- 
trophic Aquaculture (IMTA) as a means of improving 
the sustainability of open-ocean fish farming, by 
mitigating effluent impacts and more fully utilizing 
the nutritional inputs. In nearshore systems, the 
siting of filter-feeding bivalves around fish farm net 
pens in IMTA systems has been shown to reduce 
particulates in the water column. Similarly, siting 
macroalgal culture systems downcurrent of the fish 
farm can also reduce dissolved organics. These 
polyculture systems also offer secondary products 
(such as edible seaweeds and mussels) that are them¬ 
selves marketable. 

In open-ocean mariculture systems, however, the 
applicability and utility of IMTA is somewhat dubious. 
The priority for siting open-ocean mariculture opera¬ 
tions should be to strive for greater water clarity, depth, 
and current movement to optimize fish health and 
minimize any potential impacts on the effluent. These 
criteria are contrary to the desired characteristics of an 
IMTA operation, which presumes some entrainment of 
waters after it passes through the fish pen, with enrich¬ 
ment of the effluent providing benefits to the filter 
feeders and phototrophs. If open-ocean mariculture 


operations do indeed have no measureable impact on 
water quality, then presumably there is no benefit to be 
gained from co-siting the fish with filter feeders and 
plants. 

Furthermore, the capital and maintenance costs for 
the secondary production systems would be prohibi¬ 
tive. Open-ocean mariculture, in the foreseeable future, 
will need to focus on high-value species such as tuna, 
yellowtails, snappers, and groupers to provide a 
reasonable return. Only with the development of far 
greater operational efficiencies, in later years, will 
lower-value finfish species become economically viable. 
The commercial returns from deploying and 
maintaining even lower-valued macroalgae and bivalve 
culture lines around an offshore site would be even 
further deferred. 

In addition, the co-siting of mussels and seaweed, 
say, around an open-ocean site represents engineering 
and biological risks to the fish farm operation. The 
additional drag from the long-lines would add tremen¬ 
dous strain on the mooring system for the net pens. If 
moored separately, there would still be concerns that 
the secondary producer moorings could fail under load 
and entangle with the fish’s net pen or moorings. The 
resting stages of pathogens or parasites might also be 
harbored in the cultured species or ecosystem that 
surrounds the long-lines, resulting in a proliferative 
feedback loop and elevated levels of these pests in the 
cultured fish. 

The overarching goal of open-ocean mariculture is 
growing high-value species in near-pristine waters, 
with little potential for their interaction with surround¬ 
ing biologically active substrates. The notion of 
burdening an offshore operation with lower-value 
production systems is antithetical to this goal. The 
fish carrying capacity should allow the natural assimi¬ 
lative capacities of the surrounding ecosystem to fully 
absorb the farm’s ecological footprint. The nutritional 
by-products, that is, the nutrients in the effluent from 
open-ocean mariculture systems can still promote pro¬ 
ductivity on a broader scale, especially in oligotrophic 
tropical waters. However, a deeper ecological perspec¬ 
tive would suggest that there is no need for man to 
capture this productivity for commercial gain. The 
oceans can use these inputs in other ways; they are 
not necessarily “lost.” 
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Radioecology is a combination of radiobiology and 
ecology that involves the study of the behavior 
of radionuclides in the environment and their 
effects on organisms. Radioecology is multidisciplined 
involving studies of pathway analysis, speciation, 
bioaccumulation, food chain/food web transfer, radio¬ 
nuclide cycling, biological effects (radiobiology), 
modeling, prediction, environmental protection, risk 
assessment, and remediation, and may involve either 
ecosystem components or whole ecosystems. The goal 
of radioecology is to better understand, predict, and 
prevent environmental impacts due to radiation. This 
frequently involves the use of quantitative predictive 
models to assess the movement of radionuclides in the 
environment, and their transfer to biota where effects 
may be at the genetic, tissue/organ, individual, popu¬ 
lation, or community level. 

The articles comprising this section describe the 
sources of radiation in the environment, the behavior 
of radionuclides in environmental media, their transfer 
to biota and potential radiation effects on biota. The 
basis and content of each of the 15 articles in this 
section are described below. 

Radionuclides are ubiquitous in the environment 
and may be naturally occurring or the product of 
nuclear technology. Natural radiation consists of pri¬ 
mordial radionuclides present since the formation of 
the earth and cosmogenic radionuclides. Primordial 
radionuclides include the 238 U, 232 Th, and 235 U decay 
chain radionuclides and 40 K which forms a small per¬ 
centage of stable K (0.017%) present in all living organ¬ 
isms. Naturally occurring radioactive material is called 
NORM, whereas technologically enhanced naturally 
occurring radioactive material is called TENORM and 
includes mine/mill tailings, fertilizer, pulp and paper, 
and petroleum. Other sources of radiation are historic 
weapons testing, nuclear power plant operations, acci¬ 
dents (Chernobyl, Kyshtym), medical and industrial 


usage, and waste disposal ( ► Radiation in the Environ¬ 
ment, Sources of). 

In the nuclear fuel cycle, the mining and milling of 
uranium, reprocessing of spent fuel, and waste man¬ 
agement represent the largest releases of radionuclides 
to the environment. Uranium is a common contami¬ 
nant released from these activities. Uranium is more 
chemotoxic than radiotoxic and can be harmful at low 
concentrations especially to phytoplankton and 
zooplankton. Absorption of uranium in the digestive 
track of mammals is low and uranium does not 
biomagnify up the food chain. In vertebrates, damage 
is primarily to the kidney ( ► Uranium in the Environ¬ 
ment: Behavior and Toxicity). 

Tritium is a radioactive hydrogen atom that may 
be present as tritiated water (HTO), tritiated gas 
(HT), or organically bound tritium (OBT). Tritium 
exchanges readily with hydrogen atoms in media 
including water, so tritium moves rapidly through the 
environment by evaporation, transpiration, diffusion 
and isotope exchange, and enters biota and humans 
via inhalation and ingestion. Photosynthesis is the 
most important process converting tritium into OBT 
in complex molecules, increasing its retention in 
organisms. Dynamic models are available that account 
for isotopic discrimination and involve studies of 
atmospheric releases, model validation 
and uncertainty, and are process oriented ( ► Tritium 
in the Environment). Because tritium is a weak beta 
emitter, it is an internal hazard when inhaled, 
ingested, or absorbed into the body. Biological model¬ 
ing of tritium in the body involves the use of dose 
models for inhalation and ingestion of tritiated 
water based on specific activity concepts to access 
potential effects such as teratogenic, reproductive, and 
carcinogenic effects ( ► Tritium, Health Effects and 
Dosimetry). 

Within the terrestrial system the soil is the ultimate 
repository for radioactivity. Radionuclides are assimi¬ 
lated into soil organisms (microbes, invertebrates, and 
plant roots) or immobilized in abiotic components. 
Important properties of radionuclides are their biolog¬ 
ical half-life and ecological half-life. Chemical properties 
of radionuclides control their behavior and transport. 
Daughter products may have significantly different 
chemical properties from their parent. The soil 
hydrology is important in controlling the transport 


E 





3584 


E 


Environmental Radioactivity and Ecotoxicology of Radioactive Substances, Introduction to 


and the buildup of radionuclides. Particle surface area to 
volume ratio, cation exchange capacity, metal oxide and 
hydroxides, organic matter, cation exchange, precipita¬ 
tion/dissolution, surface complexation, redox reactions, 
plant uptake, transfer factors, solute partitioning, dis¬ 
persal coefficients, and retardation all interact to influ¬ 
ence the behavior and effect of radionuclides in the soil 
environment (► Radionuclide Fate and Transport in 
Terrestrial Environments). 

Soil erosion robs agricultural land of its productiv¬ 
ity, deposits sediment in downstream areas and 
impacts water quality. Distinctive behavior of 
fallout radionuclides ( 137 Cs, 210 Pb, 7 Be) allow the 
quantification of sediment mobilization and deposi¬ 
tion rates, tracing movement of fine sediment and 
associated contaminants through the landscape, dis¬ 
criminating the sources of sediment, and how this 
changes with change in land use/management practices 
( ► Fallout Radionuclides and the Study of Erosion 
and Sedimentation). 

In the arctic, lichens are the basis of the lichen/ 
caribou/human (wolf) food chain and are important 
in the transfer of radionuclides to top predators. 
Lichens have a large surface area and no roots but 
scavenge nutrients/radionuclides from the air. Caribou 
and reindeer (Rangifer tarandus) are the only mamma¬ 
lian species that can subsist on a diet of lichen over the 
winter. Caribou populations are presently on the 
decline possibly because contaminants such as radia¬ 
tion may interact with other environmental stressors 
and weaken the overall ability of the animals to survive. 
Long-lived-artificial radionuclides ( 137 Cs and 90 Sr) and 
natural radionuclides ( 238 U and 232 Th decay chain 
radionuclides) accumulate in caribou. Following the 
Chernobyl accident high levels of radiation were 
found in Swedish reindeer, and chromosomal aberra¬ 
tions were observed in Norwegian reindeer calves 
(► Radiation Effects on Caribou and Reindeer). 

To assess the contribution of foodstuffs to the inter¬ 
nal doses received by humans and wildlife, models rely 
on transfer parameters which describe the empirical 
ratios of activity between two compartments. Transfer 
parameters such as soil to plant concentration ratios 
are lumped parameters that account for the various 
processes and interactions that result in radionuclide 
concentrations in these media: speciation, oxidation, 
reduction, transformations and interactions with soil 


components, clay and organic matter content, stable 
element concentration, pH, land management, and soil 
solutions. Transfer coefficients to farm animals con¬ 
sider metabolic rate, milk production rate, soil 
intake, stable element intake, transfer of radioiodine 
to milk, partition coefficients, and allometric relations 
(► Radiation Assessment, Use of Transfer Parameters). 

Past practices and accidents have left radiation leg¬ 
acies. Remediation of these contaminated sites is 
receiving considerable attention. Many factors govern 
the effectiveness and cost of countermeasures, most of 
which are site specific. Environmental restoration is 
a complicated multidimensional task which needs to 
take into account radiological, ecological, and social- 
economic issues within a holistic approach. The ICRP 
principles of protection: justification, optimization, 
and dose risk limitation cost benefit analysis should 
be applied. Practitioners need to identify suitable, 
effective individual procedures, remediation technolo¬ 
gies, communication strategies, and decision and man¬ 
agement options. Spatial factors such as size of affected 
area, direct protection of the population, restriction of 
food, percent reduction in dose, and effectiveness must 
be considered ( ► Environmental Countermeasures 
and Restoration). 

The behavior of radionuclides in freshwater ecosys¬ 
tems depends on the chemical characteristics of the 
radionuclide, the nature of the waterbody, and the phys¬ 
ical state (particulate, dissolved, gaseous, colloidal) of the 
radionuclide. The chemical composition of the water 
(pH, dissolved organic content, alkalinity, hardness), 
partition coefficient, precipitation, uptake by phyto¬ 
plankton, sedimentation, transfer to sediment, and sed¬ 
iment focusing interact to govern the behavior of 
radionuclides. Transfer to sediment depends on 
whether the radionuclide behaves as a conservative 
(nonreactive - remains dissolved in the water column,) 
or nonconservative (reactive - has a strong affinity for 
suspended particles and sediment) element, sediment 
type, organic and clay content, water temperature, hydro- 
logical flushing rate, lake/river morphology, redox con¬ 
ditions, decomposition, food web transfer, feeding 
behavior, and trophic level of organisms (See section on 
“Aquatic Ecosystems” in ► Uranium in the Environ¬ 
ment: Behavior and Toxicity). 

The release of radionuclides to the marine environ¬ 
ment such as following the Fukushima accident in Japan 
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in April 2011 will result in interactions with individual 
organisms and food webs. The speciation of radionu¬ 
clides in seawater differs from that in freshwater and 
affects the mobility of radionuclides; their interaction 
in the water column; interactions with sediments, 
suspended sediments, and colloids; and transfer to 
local communities of animals and plants and ultimately 
their toxicity. Radionuclides in seawater may behave as 
nonconservative substances (e.g., Pu and Am) or con¬ 
servative substances (e.g., Cs, Sr, and Tc), which affects 
their loss to sediment and cycling in the food web 
(► Radioactivity in the Marine Environment). 

Marine databases of radionuclide concentrations in 
seawater have been compiled by regional and global 
projects to assess the movement of ocean currents. 
Movement of conservative ( 3 H, 14 C, 90 Sr, 129 I, 137 Cs, 
etc.) and nonconservative radionuclides ( 238, 239, 240 Pu, 
241 Am, etc.) have been used to study the circulation of 
ocean water in the Atlantic, Pacific, and Indian Oceans 
and interocean exchange of water. Models have been 
developed to describe currents such as the Great Ocean 
Conveyor Belt and Global Ocean Circulation Model. 
Radionuclide releases from Sellafield, UK and La 
Hague, France, have provided considerable informa¬ 
tion on the movement of water in the eastern North 
Atlantic and adjacent seas (► Radionuclides as Tracers 
of Ocean Currents). 

Nuclear accidents such as the Chernobyl accident 
released large pulses of radionuclides ( 90 Sr and 137 Cs) 
that allow researchers to follow a pulse through the 
environment improving our understanding of the 
behavior of radionuclides and the predictability of 
models. The complex processes of radionuclide migra¬ 
tion through a drainage basin are dependent on 
the characteristics of the specific radionuclide and the 
catchment. Movement of radionuclides is primarily 
governed by the hydrology and interactions with soil 
and, in aquatic systems, suspended and bottom sedi¬ 
ment. The complex hydrological processes that control 
runoff such as water flow, sediment erosion, and 
physical and chemical processes of contaminant inter¬ 
actions with soil govern transport and can be described 
by radionuclide transfer functions per unit of deposi¬ 
tion ( ► Radionuclide Migration from Catchments, 
Modeling). Models have been developed to predict 
inflow fluxes and outflow fluxes for rivers, lakes, and 
coastal regions. Dynamic models not only consider 


migration of contaminants in water bodies, removal 
to deeper waters, transfer to sediment, resuspension, 
and transfer through the food web, but incorporate 
biodilution and chemical dilution of radionuclides 
to fish and the biotic processes that control 
bioaccumulation and persistence in fish ( ► Nuclear 
Accidents, Chernobyl Fallout in Scandinavian 
Watersheds). 

Chemical speciation affects the mobility, bioavail¬ 
ability, and toxicity of metals and radionuclides. This is 
discussed for the terrestrial (► Radionuclide Fate and 
Transport in Terrestrial Environments), and aquatic 
environment ( ► Speciation of Metals, Effects on 
Aquatic Biota) and uranium ( ► Uranium in the Envi¬ 
ronment: Behavior and Toxicity). The marine specia¬ 
tion of radionuclides differs from that in freshwater 
(►Radioactivity in the Marine Environment). An 
understanding of metal/radionuclide speciation allows 
the prediction of the interaction of dissolved metals/ 
radionuclides with biological surfaces of organisms. 
The Biotic Ligand Model combines chemistry, physiol¬ 
ogy, and toxicology simultaneously to account for geo¬ 
chemical speciation and binding to specific sites to 
assess the bioaccumulation and toxicity of the chemical 
species. The Biotic Ligand Model is receiving consider¬ 
able attention in the study of metal toxicity to aquatic 
organisms, but is equally relevant to the accumulation 
and toxicity of radionuclides by biota ( ► Speciation of 
Metals, Effects on Aquatic Biota). 

The impact of radiation on organisms depends on 
the exposure rate, duration of exposure, time frame of 
biological repair and endpoint studied, which change 
with age. Exposures are either acute or chronic. Acute 
exposures in the environment are essentially restricted 
to the brief period following a major accident. The 
major consequence of chronic exposures at low dose 
rates as a result of routine releases is genetic effects. 
Chromosomal effects are also seen in organisms 
including humans living in areas of naturally elevated 
background levels of radiation. Radiation damage to 
DNA may include single- or double-strand breaks. 
Errors in DNA repair show up as chromosomal aber¬ 
rations and may eventually result in genomic instability 
through the generation of dicentric or acentric chro¬ 
mosomal fragments and may be expressed at the 
population level several generations later. High expo¬ 
sure rates affect the rates of morbidity, reproduction, 
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and mortality depending upon such factors as tissue 
sensitivity, dose fractionation, relative biological effec¬ 
tiveness of the radiation type, and adaptive response. 
Since most exposures are to low dose rates, potential 
effects are small and difficult to quantify and are tempo¬ 
rary in nature, the effect being reversible. The primitive 
germ cell is the most sensitive cell type. The biological 
effect is dependent on the number of cells struck by 
ionizing radiation, and energy type; alpha, beta, or 
gamma. The life history of the organism needs to be 
considered in the assessment of potential effects. Most 
vulnerable are long-lived, slow-reproducing species. 
Effects generally reported in exposed populations are 
life shortening, reproductive impairment, increased par¬ 
asite loads, decreased survival of young, increased muta¬ 
tion rates, deformities, and genetic effects. However, it is 
difficult to differentiate the effect of radiation from that 
of other contaminants. Protection can be at the individ¬ 
ual, population, or community level ( ► Ionizing Radia¬ 
tion on Nonhuman Biota, Effects of Low Levels of). 
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Ecological Risk Assessment 

Carroquino et al. ( ► Environmental Toxicology: Chil¬ 
dren at Risk) discuss the numerous factors that make 
children at greater risk from exposure to toxic sub¬ 
stances than adults. They point out that children have 
higher exposures relative to body weight than adults 
because they drink more water (seven times more per 
kilogram), eat more food (three to four times as much 
per kilogram for children between the ages of 1 and 5), 
and breathe more air (up to twice as much per kilo¬ 
gram for a child less than 1 year old). Lurthermore, 
during the time that their central nervous systems are 
developing, children are susceptible to permanent neu¬ 
rological damage from exposure to neurotoxins such as 
methyl mercury, lead, and ionizing radiation. And if 


endocrine disruptors send false signals to developing 
reproductive organs, there is a high probability that the 
resulting dysfunction will be permanent and irrevers¬ 
ible. Babies in utero are especially sensitive to the use of 
addictive substances such as drugs and alcohol by their 
mothers, and studies have shown that maternal 
smoking during pregnancy increases the risk of preg¬ 
nancy loss, stillbirth, and infant mortality. 

Willett and Loran ( ► Ecological and Health Risks at 
Low Doses) discuss what is known and, more fre¬ 
quently, what is not known about the mechanisms 
associated with human health effects caused by expo¬ 
sure to low doses of xenobiotics. Traditional sigmoidal 
dose-response models lead to the conclusion that 
below a certain threshold dose, there are no adverse 
health effects. Application of such models has led to the 
use of so-called no observed adverse effect levels 
(NOAEL), which in turn have become the basis for 
setting acceptable daily intakes of substances known 
to cause adverse effects at higher doses. 

In most cases, health effects at low doses are extrap¬ 
olated from experimental results at much higher doses 
using a linear dose-response model. But in a number of 
cases, low-dose treatments with a toxicant have been 
shown to induce a beneficial response, a phenomenon 
called hormesis. And pretreatment of animals with 
metals, specifically cadmium, copper, mercury and 
zinc, is protective for subsequent exposures. The mech¬ 
anism underlying the acclimation to metals is synthesis 
of thionein, a protein that normally only occurs in trace 
amounts in certain tissues (blood, gills, liver, kidney, 
and intestine) and can effectively sequester toxic metals 
if the dose is not too great. But in humans, at least, 
there is a growing consensus that any amount of lead in 
the body can be damaging, and especially so in chil¬ 
dren, in whom adverse effects of lead exposure are 
typically associated with brain damage. Exposure to 
neurotoxic organophosphorus pesticides such as chlor- 
pyrifos and to endocrine disruptors such as Bisphenol 
A are additional examples of risks that are poorly 
quantified because of inadequate understanding of 
the mechanisms underlying effects at low doses. 

Bain ( ► Ecological Risk Assessment and Animal 
Models) discusses the many issues associated with 
assessing the risk to plants and animals caused by 
damage/modification of the environment by human 
activities. As noted by Bain, this is a multistep process, 
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culminating ultimately in a phase characterized as risk 
management. The first few steps involve problem for¬ 
mulation and analysis, and these are the focus of the 
chapter. Much of the information upon which current 
environmental standards are based has come from 
either acute (short-term) or chronic (long-term) expo¬ 
sure of organisms to stress and observing the effects on 
survival, growth and development, and reproduction. 
Since the number of organisms potentially impacted by 
toxic substances and stresses is very large, care must be 
taken to carry out these bioassays with organisms that 
are in some sense representative of those found in the 
natural environment, including in particular some of 
the more sensitive species and life stages. Since such 
bioassays in most cases involve exposure to only 
one stress, an alternative, particularly when concern 
involves a combination of stresses and toxic substances, 
is to expose test organisms to, for example, contami¬ 
nated soil or water and observe the impacts on survival, 
growth, and/or reproduction. In this way, the interac¬ 
tion of a combination of stresses is taken into account. 

Alternatives to such bioassays include use of so- 
called bioaccumulation factors (BAFs) to estimate 
the effective degree of exposure based on the 
concentration(s) in the organism. In the last several 
decades, an approach to quantifying the overall health 
of a community of organisms has been the use of so- 
called biotic indices and rapid bioassessment protocols. 
Such surveys compare populations and community 
compositions of macroinvertebrates, fish, or periphy¬ 
ton (communities of organism and organic matter 
attached to surfaces) between a reference site and the 
site of interest. Finally, and most recently, molecular 
methods, so-called “omics” (genomics, proteomics, 
transcriptomics, and metabolomics), have begun to 
be used, with the expectation that their use will provide 
a more sensitive assay for stress than the proverbial 
canary in the coal mine. While this expectation is very 
likely true, the interpretation of results from omics 
studies with respect to issues such as risk management 
is an evolving art. 

Medema (► Microbial Risk Assessment of Patho¬ 
gens in Water) discusses the theoretical and practical 
issues associated with using quantitative microbial risk 
assessment (QMRA) to estimate the risk to public 
health from the presence of pathogens in drinking 
water derived from surface sources. This is an approach 


that has become popular in recent years because of the 
absence of relevant epidemiological data that might 
otherwise be used to estimate risks associated with 
drinking contaminated water. The use of QMRA 
has benefitted from the existence of a database that 
can be used to relate dose to the risk of infection by 
the protozoan pathogen Cryptosporidium parvum , 
a database, incidentally, derived from studies with 
human volunteers. An important point noted by 
Medema is that in many cases, the risk estimates are 
dominated by relatively infrequent so-called hazardous 
events, such as floods, when the water distribution 
system is compromised and/or the source water is 
seriously polluted. The water distribution system in 
New Orleans following Hurricane Katrina is a case in 
point. 

The strategy in QMRA is to use mathematical and 
statistical models to follow drinking water from its 
source to a treatment system, through a distribution 
system, and finally to the consumer and to estimate the 
probability that a person drinking water from his/her 
tap would be infected by a particular pathogen. The 
process is an iterative one, and Medema notes that one 
of the important products of the modeling exercise is 
identification of knowledge gaps. 

In addition to its role in estimating risks due to 
consumption of contaminated water, QMRA has been 
used to identify the risks associated with the use of 
reclaimed wastewater, for example, to irrigate golf 
courses and crops destined for human consumption, 
and for recreational water use. As in the case of drink¬ 
ing water, the strategy is to develop a probabilistic 
model of the public health risk associated with such 
practices. Persons and agencies responsible for risk 
management use the output of QMRA to make deci¬ 
sions about appropriate levels of drinking water treat¬ 
ment and to establish appropriate policies for water 
reuse and recycling. 

Monitoring 

Soelberg and Furlong ( ► Biosensors and Bioassays for 
Ecological Risk Monitoring and Assessment) describe 
a portable device for monitoring pollutants based on 
surface plasmon resonance (SPR) technology. This is 
a rather esoteric technology but simplistically takes 
advantage of the change in refractive index (RI) of 
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a solution adjacent to a gold surface when the solution 
contains a target compound. The gold surface is coated 
with so-called recognition elements that provide rec¬ 
ognition element (e.g., antibody) attachment sites for 
the target compound. The technology is designed to 
detect targets dissolved or suspended in a liquid 
medium, but the chapter includes a discussion of 
methodologies for collecting/transferring substances/ 
compounds from air or soil samples or from the surface 
of solid objects. 

Details of the detection process depend on the size 
of the target, which may range from small molecules to 
single-celled organisms such as viruses and bacteria. 
Binding of small molecules to the sensor surface usually 
does not produce a change in RI large enough to be of 
practical use, but the detection of such small molecules 
can be achieved using a competition/inhibition assay 
involving another larger analyte. For larger molecules 
such as proteins, binding of the target may effect 
a change in RI sufficient for detection, but the signal 
can be amplified by addition of a second recognition 
element that binds to another site on the target mole¬ 
cule. Whole cells, which are generally too large to be 
directly detected by SPR technology, may, nevertheless, 
be detected indirectly as a result of their binding to an 
appropriate antibody, whose concentration is assayed 
by SPR in the presence and absence of the cells of 
interest. Because of the small size of the sensor chips, 
it is quite possible to design a flow-through system that 
can detect multiple analytes in sequential fashion. 
Future directions may involve the further miniaturiza¬ 
tion of the technology via illumination of an array of 
recognition element spots in a way that would allow 
detection of hundreds or perhaps thousands of analytes 
simultaneously. 

Keum et al. ( ► Biomarkers and Metabolomics, Evi¬ 
dence of Stress) review the information and under¬ 
standing of the effects of stress on organisms that has 
been derived from a study of metabolomics. This is an 
emerging area of omics research that lies downstream 
of genomics and proteomics. The rationale for identi¬ 
fying metabolic fingerprints associated with particular 
kinds of stress is to provide an early warning mecha¬ 
nism that could be used to trigger corrective action 
before serious damage has been done. To date, much 
of the metabolic work has involved single-celled micro¬ 
organisms and plants and their responses to stresses 


such as nutrient deprivation, lack of water, pesticides, 
and/or salt stress. One of the more interesting 
metabolomic studies has been the investigation of the 
response of plants to attack by herbivorous insects and 
pathogens. Evolutionary arguments would lead one to 
expect that these responses provide some degree of 
defense for the plants, and elucidation of the relevant 
upstream portion(s) of the genome has enabled genetic 
engineers to produce crops resistant to attack by 
important plant pests. As noted by the authors, the 
metabolomes of mammals are more complex than 
those of plants and microorganisms, but ultimately, 
lessons learned from the study of relatively simple 
organisms will likely facilitate more informed investi¬ 
gations of the human metabolome, the etiology of 
various forms of cancer, and the metabolic pathways 
of preclinical drugs being obvious topics of interest. 

Boehm and Soller ( ► Recreational Water Risk: 
Pathogens and Fecal Indicators) discuss the problems 
associated with determining the risk of infection asso¬ 
ciated with recreational water use. Although there is 
a long list of pathogens that could potentially infect 
swimmers, more than 95% of all non-food-borne 
illnesses in the United States are caused by only 
eight: the viruses norovirus, rotavirus, and adenovi¬ 
rus; the bacteria Campylobacter , Salmonella , and path¬ 
ogenic E. coli; and the protozoans Cryptosporidium 
spp. and Giardia lamblia. Because the concentrations 
of such pathogens in recreational waters are typically 
low, monitoring of recreational waters has traditionally 
relied on the concentrations of so-called indicator 
organisms, which are found in relatively high concen¬ 
trations in human feces but are not themselves 
pathogenic. Problems associated with the use of fecal 
indicator bacteria include the variable ratios of indica¬ 
tors to pathogens and the fact that epidemiological 
studies have shown that there is a human health 
risk associated with swimming in waters where there 
is no apparent source of human fecal contamination. 
In such cases, the risks of infection are presumably 
associated with animal excreta, including birds, pigs, 
and cattle. 

A complement and possibly alternative to tradi¬ 
tional recreational water quality monitoring tech¬ 
niques is quantitative microbial risk assessment 
(QMRA), a health risk modeling approach that trans¬ 
lates microbial exposures into infection or illness risk 
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estimates. The input to such models includes informa¬ 
tion on the die-off rates of pathogens in the environ¬ 
ment, assumptions about the amount of water 
swallowed by recreational swimmers, and the infectiv- 
ity of the various pathogens. Comparisons of the out¬ 
put of QMRA models with the incidence of relevant 
infections have provided valuable insights concerning 
the likely contribution of land runoff and animal feces 
to the risks associated with recreational water use. 
Historically, the pathogen that has received the most 
attention in QMRA simulations is the rotavirus 
because of its high infectivity and the availability of 
dose-response information. More recent work has 
focused on the norovirus following publication of rel¬ 
evant dose-response information in 2008. 

Mechanisms 

Wilson ( ► Environmental Toxicology: Carcinogenesis) 
provides an excellent overview of the numerous and 
diverse mechanisms responsible for causing cancer. He 
points out that nine different conditions must be sat¬ 
isfied before a malignancy can develop and that to 
satisfy each of these conditions, one or more mutations 
are needed to alter the intracellular signaling pathways 
and/or response to the tissue microenvironment. The 
human body in fact has numerous defenses against 
cancer, including inter alia, DNA repair systems with 
phenomenally high fidelity, detoxifying enzymes for 
reactive species, glutathione to quench reactive electro¬ 
philes, and, in the case of mammalian and human cells, 
the ability to become terminally senescent or undergo 
self-suicide (apoptosis) upon self-recognition of excess 
DNA damage or perturbation of cell growth controls. 
However, these defenses are not infallible, and a variety 
of mutations generally classified as (1) single-base 
mutations, (2) chromosomal aberrations, (3) insertions 
and deletions (indels), and (4) epigenetic mutations 
can eventually lead to cancer, the probability being 
directly proportional to the dose and duration of car¬ 
cinogen exposure. 

With a few exceptions (e.g., mesothelioma and expo¬ 
sure to asbestos), most cancers have multiple etiological 
causes. Therefore, the development of a particular 
form of cancer does not, in most cases, implicate 
a mechanism. On the other hand, the mechanisms asso¬ 
ciated with some forms of stress are rather well 


established. Chronic alcohol consumption, for example, 
is associated with oral cavity, pharynx, larynx, esopha¬ 
geal, liver, and colorectal cancer, and breast cancer in 
women, and suspected to be involved in pancreatic and 
lung cancer. Alcohol (i.e., ethanol), however, is not itself 
genotoxic, but acetaldehyde, to which ethanol is 
converted in the liver, is mutagenic. Chronic alcohol 
use is associated with three carcinogenic pathways: 
DNA damage and mutation, inflammatory processes 
that promote proliferation, and vitamin deficiencies 
that perturb DNA epigenetic patterns. Exposure to ultra¬ 
violet radiation is associated with skin cancer, the most 
common cancer in the United States. Ultraviolet radia¬ 
tion in the wavelength range 290-320 nm causes DNA 
mutations leading to signature cyclobutane pyrimidine 
(6-4) pyrimidone photoproducts and pyrimidine 
dimers, which, if not repaired, result in signature 
C (cytosine) to T (thymine) and CC to TT mutations, 
respectively. Tobacco smoke, probably the most impor¬ 
tant anthropogenic carcinogen of the twentieth century, 
contains at least 300 known or suspected carcinogens 
and is believed to be an important etiological factor in 
many cases of oral cancers, esophageal, bladder, pancre¬ 
atic, kidney, and possibly breast and cervical cancers. 
Many of the carcinogens in tobacco smoke alkylate 
or arylate DNA, with consequent production of 
promutagenic lesions, or produce DNA adducts that 
can lead to primarily G (guanine) to T (thymine) 
transversions, which are the most common single-base 
mutations in lung tumors. Despite the remarkable num¬ 
ber of carcinogens in tobacco smoke, many years of 
smoking (~50 pack years in the case of lung cancer) 
are generally required before a tumor develops, 
a testimony, as noted by Wilson, “to the resilience of 
the human body to the bombardment of such a powerful 
combination of the carcinogens and toxic agents present 
in tobacco smoke.” 

Jones (►Environmental Toxicology: Oxidative 
Stress) points out that understanding of the mecha¬ 
nisms that underlie oxidative stress has changed signif¬ 
icantly in recent years. Thirty years ago, oxidative 
stress was understood to reflect a predominance of 
prooxidants (agents that initiate radical reactions, oxi¬ 
dize biologic components, or interfere with normal 
reductive and antioxidant functions) over antioxi¬ 
dants, leading to macromolecular damage. If oxidative 
stress were as simple as that, supplementation with 
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antioxidants would be expected to provide protection 
but in fact has been found to be associated with little or 
no health benefits in humans. Furthermore, the discov¬ 
ery of NADPH oxidases that generate 0 2 ~ and H 2 0 2 as 
signaling molecules suggested that oxidative stress 
could involve disruption of redox signaling and control 
mechanisms. Living cells have a variety of mechanisms 
that almost completely prevent the sort of radical chain 
reactions associated with the more traditional concept 
of oxidative stress. Although such chain reactions may 
be relevant in cases of acute exposure, non-radical 
oxidants such as H 2 0 2 , lipid hydroperoxides, quinones, 
disulfides, and reactive nitrogen species are now 
believed to be far more important from the standpoint 
of chronic toxicity. The major mechanisms of non¬ 
radical oxidative stress involve primarily effects on 
thiols and selenols in proteins and secondarily muta¬ 
genic damage to DNA. Non-radical oxidants can inter¬ 
fere with reversible oxidation-reduction reactions of 
thiols or selenols (e.g., the thiol in cysteine, the 
thioether in methionine, and the selenol in 
selenocysteine) by causing abnormal oxidation or irre¬ 
versible modification. These changes alter physiologi¬ 
cal function in receptor signaling, transcriptional 
regulation, cell proliferation, angiogenesis, and apo¬ 
ptosis. The hypothesized mechanisms presume that 
(1) all biological systems contain redox elements that 
function in cell signaling, (2) organization and coordi¬ 
nation of the redox activity of these elements occurs 
through redox circuits dependent upon common con¬ 
trol nodes, (3) the redox-sensitive elements are spatially 
and kinetically insulated, and (4) oxidative stress is 
a disruption of the function of these redox circuits 
caused by specific reaction with the redox-sensitive 
thiol/selenol elements, altered pathways of electron 
transfer, or interruption of the gating mechanisms 
controlling the flux through these pathways. Because 
biological systems are effectively engineered to conceal 
stress effects, subtoxic exposures can accumulate with¬ 
out overt signs of adverse effects. A redox proteomics/ 
gene expression approach may therefore provide 
a means for systems analysis to detect evidence of 
oxidative stress before overt symptoms become 
apparent. 

Laws ( ► Toxic Chemical Risks) note that most 
exposure to toxic chemicals is associated with lifestyle 
decisions and not, as might naively be expected, 


because a person lives downwind or downstream 
from a source of pollution. In the United States, by 
far, the biggest source of exposure is tobacco smoke, 
which, as noted by Wilson (vide supra), contains sev¬ 
eral hundred known or suspected human carcinogens. 
The list of toxic substances includes ammonia, arsenic, 
benzene, benzo[a] pyrene, cadmium, carbon monoxide, 
nicotine, and several compounds classified as tobacco- 
specific N-nitrosamines. The death toll from cigarette 
smoking accounts for about 20% of all deaths in the 
United States. Fortunately, the percentage of adults who 
smoke in the United States has been steadily declining 
from 42% in 1965 to 21% in 2008-2009. 

After tobacco smoke, the most significant source of 
exposure to a toxic chemical in the United States is 
alcohol (ethanol) consumption. Health problems asso¬ 
ciated with chronic alcohol consumption include liver 
damage, a variety of reproductive system disorders, and 
damage to the digestive tract. Alcohol abuse accounts 
for about 100,000 deaths per year in the United States. 
The specific causes of death include drunken driving, 
cirrhosis of the liver, cancer, and stroke. In addition to 
these direct effects on human health, alcohol abuse is 
associated with half of all homicides and 40% of 
assaults. Alcohol consumption can be addictive, and 
about 8% of Americans abuse alcohol or are alcoholic. 

Drug misuse/abuse follows alcohol in terms of 
the frequency with which people are exposed in the 
United States. The number of deaths associated with 
unintentional drug overdoses has been rising more or 
less exponentially during the last 30 years and is now 
approaching 30,000 per year. The biggest single source 
of exposure is abuse of prescription analgesics (pain¬ 
killers). The painkillers in question are opioids, which 
puts them in the same drug category with drugs such as 
heroin and cocaine. They effectively block feelings of 
pain by binding to opioid receptors in the brain, spinal 
cord, and gastrointestinal tract, but they also produce 
feelings of euphoria, which accounts for their abuse 
and compulsive use. When used as prescribed, opioid 
analgesics seldom result in clinical addiction. Lethal 
effects are associated with the fact that unprescribed 
opioid use can lead to a suppression of breathing that 
may ultimately lead to respiratory failure. 

Cocaine and heroin are both illegal opioid drugs. 
Cocaine has virtually no beneficial application, and 
there is no approved medication to treat cocaine 
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addiction. Heroin abuse is associated with a variety of 
health effects related to the tendency of addicts to inject 
the drug with a needle that in some cases is far from 
sterile. Thirty-six percent of AIDS cases and an esti¬ 
mated 70-80% of the new hepatitis C cases in the 
United States each year are attributable to injection 
drug use. Fortunately, there are a variety of effective 
treatments available for heroin addiction. 

Obesity has become a major public health problem 
in the United States and reflects an old adage about 
toxic substances: the dose makes the poison. Food is 
essential for living organisms, but consumption of too 
much food can lead to serious health problems, includ¬ 
ing, in the case of humans, morbidity from hyperten¬ 
sion, abnormal amounts of fat in the blood, type II 
diabetes, coronary heart disease, stroke, gallbladder 
disease, osteoarthritis, sleep apnea, respiratory prob¬ 
lems, endometrial, and social stigmatization and dis¬ 
crimination. At the present time, roughly 100 million 
adults in the United States are overweight or obese. The 
solution is to consume fewer calories, but for many 
people, this is a difficult goal to achieve on a sustainable 
basis. Behavior therapy may be necessary to effect the 
necessary changes in lifestyle. 

Of the problems with little or no lifestyle connec¬ 
tion, lead intoxication appears to be the most perva¬ 
sive. Several decades ago, the use of leaded gasoline was 
a significant source of exposure, but this practice was 
fortunately phased out in the United States in the 
1970s. Today, the major lead-related issue is the fact 
that many residences were painted with lead-based 
paint prior to 1978, when the use of such paint was 
banned by the Environmental Protection Agency. 
Unfortunately, many people live in residences that 
were painted prior to 1978, and at the present time, it 
is estimated that 250,000 children in the U.S. have 
blood lead levels exceeding 100 pg/L, a concentration 
that the Centers for Disease Control feel should not be 
exceeded. Effects of lead intoxication are variable and 
depend on the degree of exposure. In children, the 
biggest concern is effects on the brain. The apparently 
obvious solution is to remove the lead-based paint in 
old homes, but doing so in a safe way requires special 
training and appropriate equipment. 

Many of the other incidents of exposure to toxic 
chemicals involve accidents in the home. Among the 
most frequent involve ill-advised mixing of cleaning 


compounds, the most common examples being bleach 
mixed with either ammonia, vinegar, or toilet bowl 
cleaners. Chlorine gas, a highly toxic and corrosive 
gas, is often one of the products of such mixtures, and 
in some cases, the chemical reactions are very exother¬ 
mic and can easily result in explosions. 

Fate and Transport 

Macauda and Hiscox (►Airborne Toxic Chemicals) 
use three case studies to illustrate issues related to the 
fate and transport of air toxics (atmospheric pollut¬ 
ants). Perchloroethylene (PERC) is the most common 
chemical associated with dry cleaning. It has been 
linked to liver and kidney tumors in rats and is consid¬ 
ered to be a carcinogen by the EPA, which has ruled that 
the use of PERC in dry cleaning must be completely 
phased out by 2020. Persons living or working in close 
proximity to dry cleaning establishments are at greatest 
risk from exposure to PERC. In 2005, the EPA ruled 
that large dry cleaning establishments use state-of-the- 
art recovery systems to ensure that PERC is not emitted 
to the air, and that smaller dry cleaners use generally 
available control technologies to control emissions. 
Dry cleaners that operate in apartment buildings are 
particularly troublesome because residents are obvi¬ 
ously living in close proximity to a source of PERC. 
For such establishments, the EPA has mandated the 
replacement of so-called transfer machines that require 
PERC-soaked clothes to be moved from washing to 
drying machines. These must be replaced by machines 
that both wash and dry. 

Gasoline combustion creates air pollution prob¬ 
lems on a broader scale geographically. The most trou¬ 
blesome issues relate to incomplete combustion of 
gasoline, which can result in the release of numerous 
pollutants. The pollutant of greatest concern from 
a public health standpoint is benzene, which has been 
linked to leukemia. Several approaches have been taken 
to mitigating the problems associated with incomplete 
gasoline combustion. First, many states now require 
gasoline pumps to be equipped with vapor recovery 
nozzles in order to trap gases that evaporate and would 
otherwise escape during refueling. Second, gasoline has 
been reformulated, primarily by addition of ethanol, to 
burn more completely. This reformulation has signifi¬ 
cantly reduced carbon monoxide emissions, a cause of 
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smog, but at the same time has led, in the case of 
California, to a 54% reduction in benzene emissions. 
A longer-term solution may be electric cars, which 
could be recharged at night when electric power 
demand is relatively low. However, such a transforma¬ 
tive change would require an unprecedented degree of 
cooperation between the electric power and automo¬ 
bile industries. 

Mercury emissions are an even longer-range prob¬ 
lem, with emissions in Asia linked to mercury levels in 
North America. Although historically, industrial use of 
mercury has led to some very serious localized human 
health problems (e.g., Minamata Disease), on a global 
scale, coal combustion accounts for about 60% of 
atmospheric emissions. Inhalation of mercury vapor 
can lead to serious health effects because the mercury 
reaches the brain, where it can cause irreparable 
neurological damage. Ironically, the primary 
post-combustion technologies that reduce mercury 
emissions do not specifically target mercury but instead 
are designed to remove oxides of sulfur and nitrogen. 
In the USA, at least these stack gas-scrubbing technol¬ 
ogies reduce mercury emissions by about 33%. In 
recent years, the USA has considered controlling the 
remaining mercury emissions from power plants using 
a cap-and-trade approach, but this idea was struck 
down by a Court of Appeals ruling. It now appears 
that mercury emissions from power plants will be reg¬ 
ulated by the requirement for use of Maximum Achiev¬ 
able Control Technology. 

Oliver and Heathwaite (Invisible Threats: Transfer 
of Pathogens and Nutrients Through and Across Agri¬ 
cultural Soils) discuss the tendency of agricultural soils 
to serve as conduits for pathogens and nutrients 
derived from agricultural activities. The nutrients of 
primary concern are nitrogen and phosphorus. The 
pathogens include a somewhat longer list or organisms 
(E. coli 0157, Salmonella spp., Campylobacter jejuni, 
Listeria monocytogenes , Cryptosporidium parvum , and 
Giardia intestinalis) that are shed by livestock and are, 
coincidentally, pathogenic to humans. The ability of 
soils to sequester N is very much influenced by the 
fact that soil particles typically have a negative surface 
charge. Thus, the ammonium ion, NH 4 + , is effectively 
retained in soils, while nitrate, N0 3 _ , is not. Phos¬ 
phate, although negatively charged, is typically retained 
in soils because of its tendency to bind with ferric iron 


(Fe 3+ ), aluminum (Al 3+ ), or calcium (Ca 2+ ). The sur¬ 
vival and movement of pathogens is very much 
influenced by the water content of the soil and the 
existence of conduits (e.g., macropores) for transport 
within the soil matrix. Under appropriate conditions, 
pathogens may survive within soils for literally months 
before being flushed out by seepage from storm events. 
Strategies such as no-till agriculture that are intended 
to minimize the use of pesticides and reduce nutrient 
runoff may actually exacerbate pathogen export by 
maintaining the continuity of macropores. In some 
cases, relatively simple strategies can be used to mini¬ 
mize pathogen problems, a case in point being the 
application of manure in a broadcast slurry to effect 
a more rapid destruction of associated bacteria through 
UV radiation and desiccation. Because of the poten¬ 
tially long time that pathogens and particularly nutri¬ 
ents may be sequestered in soils before being released, 
a considerable lag in time may exist between the imple¬ 
mentation of management practices and discernible 
changes in the water quality of drainage systems. 
Awareness of this time lag is important to the informed 
assessment of management practices. 

Bienfang et al. ( ► Bioaccumulation/Biomagni¬ 
fications in Food Chains) discuss two somewhat differ¬ 
ent case studies that illustrate problems associated with 
the bioaccumulation and/or biomagnifications of toxic 
substances in food chains. Ciguatera fish poisoning 
(CFP) is the most common food-borne disease related 
to the consumption of marine finfish, with ~50,000 
reported cases each year, but the actual number may be 
much larger. The toxin responsible for causing CFP 
comes in several dozen congeners. The various forms 
of ciguatoxin (CTX) are believed to be metabolites of 
gambiertoxins produced by various species of the dino- 
flagellate genus Gambierdiscus. Because CTX is toxic at 
very low concentrations, and in part because there are 
so many congeners, analytical detection of CTX has 
been problematic. In fact, virtually all reported cases 
of CFP are based on symptomology rather than detec¬ 
tion of CTX. Short-term effects of CFP are a variety of 
gastrointestinal disturbances. The long-term and more 
troublesome effects are of a neurological nature. 

As a high-molecular-weight lipid, CTX is a prime 
candidate for biomagnifications, but because of the 
aforementioned analytical detection issues, direct 
evidence of biomagnification of CTX is lacking. 
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The indirect evidence is the fact that the greatest num¬ 
ber of reported cases of CFP and the most severe 
reported symptoms typically involve consumption of 
carnivorous as opposed to herbivorous fish. 

Mercury is a well-known neurotoxin that is found 
in the environment in several different forms, by far the 
most troublesome of which is methyl mercury. In con¬ 
trast to other forms of mercury that are rather effi¬ 
ciently eliminated from the human body, methyl 
mercury is, at least initially, retained with about 95% 
efficiency following consumption and is only slowly 
excreted at a rate of ^1% per day. The damaging effects 
of methyl mercury are associated with its ability to 
cross the blood-brain and placental barriers. As 
a result, exposure to methyl mercury can cause serious 
and permanent neurological damage. Fetuses appear to 
be particularly susceptible, as women evidencing no 
overt symptoms of mercury intoxication have been 
known to give birth to babies hopelessly brain damaged 
from the effects of methyl mercury. 

Though methyl mercury has been reported at 
high concentrations in some fish populations, the 
documented cases of methyl mercury intoxication 
have all involved anthropogenic discharges. Natural 
concentrations of methyl mercury in fish are, with 
very few exceptions, invariably accompanied by even 
higher concentrations of selenium, the only exceptions 
being pilot whales and mako sharks. Evidence that has 
accumulated during the last 50 years has shown that 
mercury and selenium effectively sequester one 
another. The absence of any apparent mercury-related 
adverse health effects associated with the consumption 
of virtually any species of marine fish reflects the fact 
that any mercury in the fish is rendered inert by the 
selenium in the same fish. 

Prevention 

Kennedy and Tierney (►Xenobiotic Protection/ 
Resistance Mechanisms in Organisms) point out that 
the toxic effects of xenobiotics can be mitigated either 
through acclimation or adaptation, the former involv¬ 
ing intragenerational tolerance and the latter involving 
intergenerational resistance. Acclimation may involve 
nothing more complicated than avoidance, in some 
cases instinctive and in other cases the result of experi¬ 
ential learning. Adaptation involves selection for 


individuals in a population that for some reason are 
less sensitive/more resistant to a xenobiotic than other 
members of the population. Generally speaking, adap¬ 
tation involves either (1) toxicokinetically derived 
mechanisms, which alter the way in which organisms 
absorb, biotransform, and excrete chemicals, and/or 
(2) toxicodynamically derived mechanisms, in which 
target sites are modified to reduce sensitivity. An impor¬ 
tant consideration in the case of toxicokinetic mecha¬ 
nisms is the fact that lipophilic chemicals such as PCBs 
and chlorinated hydrocarbons are not easily excreted 
because they partition back into cells and tissues from 
excretory media (e.g., urine, bile). Water-soluble 
chemicals are not reabsorbed because lipid membranes 
of cells lining excretory routes act as barriers to their 
reuptake. Thus, in the case of lipophilic toxins, 
toxicokinetic modes of adaptation involve transforma¬ 
tion of the toxin into a more polar, water-soluble com¬ 
pound. This is generally accomplished through a series 
of reactions characterized as Phase I and Phase II. In 
Phase I reactions, a nucleophilic functional group is 
introduced or exposed in the parent molecule, and in 
Phase II, the Phase I metabolite or parent molecule 
already containing a functional group is conjugated to 
an endogenous molecule to form a water-soluble prod¬ 
uct. Cytochrome P-450 mixed-function oxidases, for 
example, are a superfamily of heme enzymes found in 
all living species that mediate Phase I transformations. 
Phase II involves conjugation reactions that produce 
substrates that are secreted with great efficiency. 
Glutathione conjugation mediated by glutathione 
S-transferases is a good example of a Phase II reaction. 

Toxicodynamically derived resistance refers to alter¬ 
ations in xenobiotic-receptor interactions, which can 
be effected by alterations in target site, increased or 
decreased concentrations of target molecules, or cir¬ 
cumvention of target function. Structural changes in 
the target site can significantly reduce binding of the 
xenobiotic but, as noted by the authors, can lead to 
biological effects from reduced functionality of the 
target in its normal physiological roles. So-called 
knockdown resistance, i.e., insensitivity of Na + chan¬ 
nels to insecticide (e.g., DDT and pyrethroids) inhi¬ 
bition, may occur by one or another of several 
mechanisms, including a reduction in the number of 
Na + channels, changes in the fluidity of nerve mem¬ 
branes, or alterations in the binding characteristics 
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between Na + channels and insecticides. In the case of 
citrus-scale mites, an interesting adaptation to HCN, 
which interferes with cytochrome oxidase in the elec¬ 
tron transport chain, is the use of an HCN-insensitive 
flavoprotein as an alternate electron carrier in the 
place of cytochrome oxidase. Application of molecular 
tools such as toxicogenomics, bioinformatics, and 
metabolomics has the potential to greatly enhance 
understanding of the mechanisms by which organisms, 
including humans, acclimate and adapt to xenobiotics. 

Consumption of fish and shellfish is associated with 
a number of well-documented beneficial health effects, 
but Meujo and Hamann (► Science, Policy, and Risk 
Management: Case of Seafood Safety) point out that 
there are also health risks associated with fish and 
shellfish consumption. The risks reflect the fact that 
fish and shellfish may contain dangerous concentra¬ 
tions of pathogenic bacteria and viruses, parasites, 
toxic metals, pesticides, drugs, and a variety of toxins. 
In the United States, the Food and Drug Administra¬ 
tion (FDA) is primarily responsible for ensuring that 
seafood is safe to eat, and the strategies taken by 
the FDA to fulfill this mission reflect the diverse sources 
of the contaminants of concern, practical issues associ¬ 
ated with analytical detection and surveillance, and 
the extent to which postharvest processing can be 
used to reduce contaminant concentrations to an 
acceptable level. 

Bacterial and viral pathogens can be very effec¬ 
tively eliminated by a process called high hydrostatic 
pressure (HHP) treatment. However, neither fish nor 
shellfish survive HHP, and that fact is a deal-killer for 
oysters destined for the raw oyster market. Two prom¬ 
ising alternatives to HHP are X-ray treatment and 
supercritical C0 2 exposure. Both methods appear 
capable of killing the relevant pathogens (e.g., Vibrio 
parahaemolyticus ) without killing the oysters. 

Neurotoxins produced by a variety of microalgae can 
accumulate in fish and shellfish to levels that cause 
serious health effects on consumers. The most trouble¬ 
some of these natural neurotoxins is ciguatoxin, but 
there is a long list of others, including brevitoxin, 
okadaic acid, and domoic acid. An additional neuro - 
toxin, scombrotoxin, is not produced by microalgae but 
instead is a product of a catalytic reaction involving the 
conversion of histidine into histamine. One of the prob¬ 
lematic issues with these neurotoxins is the fact that they 


are heat stabile, so cooking has little or no effect on the 
toxicity of seafood containing these compounds. From 
the standpoint of prevention, what is badly needed in the 
case of these neuro toxins is a simple and inexpensive test 
to determine whether seafood is safe to eat. In the case of 
brevitoxin, an antibody-antigen assay using ELISA 
(enzyme-linked immunosorbent assay) has recently 
been reported. The development of similar high- 
specificity probes for the other troublesome neuro toxins 
would clearly be desirable. 

Correctives 

Portier ( ► Bioremediation and Mitigation) describes 
several case studies in which heterotrophic microor¬ 
ganisms have been used to clean up organic waste in 
soil and water. The metabolic pathways involved in the 
biological degradation of various classes of organic 
compounds are reasonably well understood, and the 
ease/speed with which microbes can decompose differ¬ 
ent kinds of organic wastes have been the subject of 
numerous investigations. Normal alkanes, for example, 
are oxidized to primary alcohols, which are then fur¬ 
ther oxidized to carboxylic acids. The carboxylic acids 
are then converted via beta-oxidation enzymes to acetic 
acid and a simpler alcohol. Polycyclic aromatic hydro¬ 
carbons (PAHs) are generally considered to be the most 
toxic components of petroleum. Simple PAHs are ini¬ 
tially converted via dioxygenase enzymes to catechols 
and substituted catechols. The strategy in bioremedia¬ 
tion is to identify a community of microorganisms 
with metabolic pathways that enable them to effect 
such transformations and, insofar as possible, to create 
environmental conditions that encourage their catab¬ 
olism of toxic substrates. 

The examples cited by Portier illustrate some of the 
diverse strategies that may be employed to effect bio- 
remediation. The first example concerns the applica¬ 
tion of adapted microbial consortia to remediate 
diesel-contaminated soils at an active rail yard. Total 
petroleum hydrocarbon concentrations in the soil were 
reduced by almost a factor of 20 over a 6-month 
timeframe. In the second example, an immobilized 
packed bed reactor filled with support media was 
used to provide a habitat for adapted microbes with 
enzyme systems capable of degrading wood preserva¬ 
tive contaminants in groundwater. The contaminated 
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groundwater was pumped through the reactors and 
then used to irrigate surface vegetation and/or perco¬ 
late back into the ground. The site in question is now 
zoned recreational, and groundwater is close to meet¬ 
ing federal maximum contaminant level water quality 
criteria. The third example involves in situ remediation 
of groundwater via use of soil seeding bioreactors 
called bioplugs or bioconduits. Through manipulation 
of subsurface hydrology, the contaminated groundwa¬ 
ter is routed through the bioreactors. Surplus microbial 
biomass is allowed to escape into the surrounding soil, 
where the microbes infiltrate contaminated areas and 
further reduce the concentrations of toxic chemical 
constituents. Although bioremediation is not an 
option for all sorts of pollutant (e.g., heavy metals), 
the strategy has proven to be remarkably successful in 
cleaning up biodegradable wastes in terms of both cost 
and sustainability of treatment effects. 

Jewett and Wascom ( ► CERCLA, Sustainability and 
Public and Environmental Health) discuss another 
kind of corrective strategy: legal action. The particular 
legal subject is the Comprehensive Environmental 
Response, Compensation, and Liability Act (CERCLA), 
one of the last pieces of legislation signed into effect 
during the administration of President Jimmy Carter. 
CERCLA was intended to provide a mechanism to clean 
up “facilities” contaminated by inappropriate disposal 
of hazardous substances, with the word facility very 
broadly defined. Hazardous substances are also broadly 
defined, with one notable exception: petroleum and 
petroleum products, natural gas, and synthetic gas 
used for fuel. This is the so-called Petroleum Exception. 

Cleanup activities carried out under the 
“Superfund” component of CERCLA were originally 
financed by taxes imposed on the chemical and petro¬ 
leum industries and an environmental tax on corpora¬ 
tions. More recently, Superfund has been supported by 
allocations from the federal general fund and through 
additional legislation, such as the American Recovery 
and Reinvestment Act. 

Thirty years after its passage, the report card on 
CERCLA is mixed. There have been some very note¬ 
worthy success stories. Of the more than 1,600 sites at 
one time or another on the National Priority List 
(NPL), 340 have been delisted, i.e., cleaned up. The 
case study discussed by Jewett and Wascom, the Agri¬ 
cultural Street Landfill in New Orleans, is a provocative 


example given the apparently sweeping powers the EPA 
has to investigate and correct problems at facilities 
contaminated by inappropriate disposal of hazardous 
substances. One certainly has to ask why a municipality 
would elect to locate residential housing and an ele¬ 
mentary school on top of a landfill known, during its 
operative days, as Dante’s Inferno. A hazard ranking 
score (HRS) of 28.5 is sufficient to place a facility on the 
NPL. The HRS for the Agricultural Street Landfill was 
50.0. Why were the results of tests conducted at the site 
in the 1970s never made public? And why, when 
the presence of lead, mercury, and arsenic was detected 
in the soil, was a clay barrier not constructed (as 
recommended) to contain the contamination prior to 
the construction of an elementary school? 

These are troublesome questions, but there is 
another side to the coin. The Small Business Liability 
Relief and Brownfields Revitalization Act has provided 
a defense for parties who have generated, transported, 
or arranged for transport of only small amounts of 
hazardous material and for so-called innocent land- 
owners in the case of facilities acquired through inher¬ 
itance or bequest. Such revisions have reshaped 
CERCLA to be more effective and fair, but, as noted 
by Jewett and Wascom, by no means supplant 
CERCLA. In summary, CERCLA is a piece of legislation 
with a very ambitious goal, and as time goes by may 
well be modified to make it more effective and fair. The 
need for such legislation is clearly apparent now and is 
unlikely to go away in the future. 
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Glossary 

Complex dose The biologically effective (e.g., carcino¬ 
genic) dose of an agent that takes into account 
additive, subtractive, and synergistic interactions 
in a complex mixture 

DNA adduct Any covalent addition or modification of 
a nucleotide or the phosphate backbone in DNA 
Driver mutation Oncogenic mutations that are pri¬ 
mary to the further advancement and development 
of a cancer 

Genotoxic Causes DNA damage and/or mutations 
Immortal stranding The retention of a specific strand 
of the DNA double helix by a stem cell regardless of 
how many times the cell divides 
Initiation The formation of the first oncogenic muta¬ 
tion in a cell 

Oncogenic process Any process that is involved in the 
development of a tumor 

Proto-oncogene A normal gene that upon mutation 
produces an active oncogenic component in the 
development of cancer 

Promotion The effect of encouraging the develop¬ 
ment of cancer by increasing cellular proliferation 
and turnover 

Tumor suppressor gene A normal gene whose expres¬ 
sion product appears to block the development of 
cancer 

Definition of the Subject 

Carcinogenesis is the induction of cancer by exposure 
to exogenous agents, chemical or physical carcinogens. 
Cancer is a large family of life-threatening environmen¬ 
tal diseases. This definition is supported by the plethora 
of published evidence that cancer results from the 
accumulation of DNA mutations, which in turn 
depends upon progressive cell divisions. This process 
takes time. The accumulation of a sufficient allotment 


of oncogenic genetic errors for cancer to develop occurs 
over a large number of cell divisions (10s to 1,000s), 
which may take 10-50 years or more. Even without the 
intentional introduction of carcinogens into the 
human body by occupation or habits such as tobacco 
smoking, alcoholic use, and other life-style choices, our 
environment provides numerous opportunities and 
sources of DNA-damaging and potentially mutagenic 
lesions in our cells. Given enough time, the risk of 
cancer is inherent with life, while exposure(s) to carci¬ 
nogenic agents not only shortens the time required to 
achieve tumor development, but also increases the 
probability of cancer occurrence. 

In the present document, the term environmental 
carcinogenesis encompasses the induction of cancer by 
exposure to carcinogenic agents whether natural or 
man-made, present in our environment. The term 
environmental carcinogens will refer to all compounds 
foreign to the human or mammalian body, that is, 
xenobiotic and physical carcinogens. However, the 
development of cancer is a long and involved process 
that requires numerous cellular perturbations generally 
produced by both exogenous and endogenous reactive 
agents. As will be clarified in this chapter, for many 
cancers it is exceedingly difficult to separate out the 
contributions of the various etiological factors, includ¬ 
ing natural environmental carcinogens from man¬ 
made chemicals (see Table 1). For instance, a number 
of carcinogenic xenobiotics have been identified 
that are inherent in our food supply, for which the 
best-known examples are aflatoxins. Aflatoxins are pre¬ 
sent in fresh corn, peanuts (and peanut butter), and 
other grains, while other carcinogens such as PhIP 
(2-Amino-1 -methyl-6-phenylimidazo [4,5- b] pyridine), 
are produced from cooking meat [1, 2]. The potent 
nephrotoxic aristolochic acid component of certain 
herbs sometimes used in herbal remedies produces 
urinary tract cancers [3-5]. To add to this disparaging 
viewpoint, the lack of proper nutrition, such as folate 
deficiency, also leads to genomic instabilities that are 
involved in carcinogenesis [6-8]. 

The contribution of a few environmental carcino¬ 
gens is more clearly evident in selective cancer cases. 
Skin cancer induced by the ultraviolet (UV) radiation 
from sunlight is one of the more prominent environ¬ 
mental cancers in our world [9, 10]. Mesothelioma 
may result from inhalation of asbestos fibers [11]. 
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Environmental Toxicology: Carcinogenesis. Table 1 The causes of cancer 


Agent//Source 

Examples 

Associated cancer(s) 

Chemical 

Aflatoxins 

Hepatocellular carcinoma 

carcinogens 3 

Alcohol 

Hepatic, breast, stomach, colorectal, oral, ovarian, pancreatic 


Aristolochic Acid 

Urinary tract transitional cell carcinoma (renal, ureter, 
bladder) 


Asbestos 

Mesothelioma and lung cancers 


Benzo(a)pyrene 

Lung cancers (tobacco smoke associated) 


Benzene 

AML and non-Hodgkin's lymphoma 


Formaldehyde 

Nasopharyngeal 


Metals 


Arsenic 

Beryllium 

Cadmium 

Chromium (VI) 

Nickel compounds 

Skin, liver, lung, kidney, bladder 

Lung cancer 

Lung 

Lung, stomach 

Lung, nasopharyngeal 


Asbestos 

Mesothelioma and lung cancers 


2-Naphthylamine 

Bladder 


Tobacco Smoke 

Lung cancers (also nasopharyngeal, esophageal, bladder, 
pancreatic, kidney) 


Tobacco Dip/Chew 

Oral cancers 


Vinyl chloride 

Hepatic angiosarcoma 

Physical 

Ionizing radiation (a, [3, y, X-rays, etc.) 

multiple types of tumors 

carcinogens 3 

Sunlight (Solar Radiation) 

Skin (BCC, SCC, and melanoma) 

Biological 

carcinogens 

Hepatitis B, chronic infection 

Hepatocellular carcinoma 

Viruses 

Hepatitis C, chronic infection 

Hepatocellular carcinoma 


Epstein-barr virus 

Burkits lymphoma, nasopharyngeal carcinoma 


HPV (high risk types) 

Cervical cancer 

Parasites 

(chronic 

inflammation) 

Schistosoma hematobium 

Bladder cancer 

Chronic 

inflammation 

Inflammatory bowel disease 

Crohn's disease 

Ulcerative colitis 

Colon or intestinal cancers 


Chronic pancreatitis 

Pancreatic cancer 


Barrett's esophagus 

Esophageal carcinoma 


Helicobacter pylori, chronic gastritis 

Stomach cancer 

Fidelity limits of 
enzymatic 

DNA Replication errors (1 to 3 errors 
per genome replication) 13 

Possibly all or most cancers 

processes 

Imperfect fidelity of DNA repair 
systems 

Possibly all or most cancers 
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Environmental Toxicology: Carcinogenesis. Table 1 (Continued) 


Agent//Source 

Examples 

Associated cancer(s) 

Genetics (inherited predisposition) 

Cancer family 
syndromes 

Breast cancer family (BRCA1 or BRCA2 
gene mutation) 

Breast cancer 


FAP (Familial adenomatous 
polyposis) 

(APC gene mutation) 

Colorectal carcinoma 


Hereditary melanoma syndromes 
Dysplastic nevus syndrome 

Familial atypical mole-malignant 
melanoma syndrome 
Melanoma-astrocytoma syndrome 

Melanoma 


HNPCC (Hereditary nonpolyposis 
colorectal carcinoma) 

(MSH2 or MLH1 gene mutation) 

Colorectal carcinoma 


Li-Fraumeni cancer family syndrome 
(TP53 gene mutation in 70% cases) 

Osteosarcoma, adrenocortical carcinoma, brain, breast, 
colon, leukemia, lung, pancreatic cancers, various sarcomas 


MEN2A and 2B (Multiple endocrine 
neoplasia, types 2A and 2B) 

(RET gene mutation) 

Pheochromocytoma, thyroid carcinoma 


Neurofibromatosis types 1 and 2 (NF1 
and NF2 gene mutations) 

Neurofibromas, schwannomas, meningiomas, 
ependymomas, acoustic neuromas 


Retinoblastoma (RB gene mutation) 

Retinoblastoma 


Von Hippel-Lindau Syndrome (VHL 
gene mutation) 

Renal cell carcinoma, pheochromocytoma, 
hemangioblastomas 

Other genetic 

Ataxia telangiectasia (AT) 

Leukemias, lymphomas 

diseases 

Bloom's syndrome 

Leukemias, lymphomas, carcinomas 


Xeroderma pigmentosum (XP) 

Skin (BCC, SCC, and melanoma) 


a AII chemical and physical carcinogens listed are classified as Group I: Carcinogenic to Humans, by the International Agency for Cancer 
Research (IARC), update January 16, 2009; http://monographs.iarc.fr/ENG/Classification/index.php 

b DNA synthesis and the replication machinery has an astonishing fidelity of one error per 10 9 base pairs, which results in one to three 
errors every time the human genome is replicated [31, 32]. 


Exposures due to personal life-style choices and habits, 
such as tobacco smoking and alcohol use have been well 
studied and also provide clear examples of environ¬ 
mental carcinogenesis [12-16]. The carcinogenic 
impact of occupational exposures is another area that 
offers substantial clarification of mechanisms and risks 
involved in cancer development. Similarly, medical 
exposures, especially those involving chemotherapeutic 
agents and radiation therapy provide further risks of 
carcinogenic impacts, although these agents comprise 
risks of a spectrum of cancer types. 


Since the same principles and mechanisms involved 
in the development of human cancer apply to various 
animal species, and the primary focus of environmen¬ 
tal carcinogenic risk calculations is designed to protect 
humans, the human will be the primary focus of this 
chapter on environmental carcinogenesis. 

Introduction 

The biophysical aspects of this planet support and 
sustain life, while also being conducive to the 




















Environmental Toxicology: Carcinogenesis 


E 


3599 


development of cancer in most multicellular species. 
Fortunately, the risk of cancer is low without the assis¬ 
tance of acute exposures to carcinogenic agents. How¬ 
ever, the risk and incidence of cancer increases with age 
in the mammalian organism [17, 18]. The adage that 
“cancer is a disease of old age” has its foundings in 
empirical observation, while also demonstrating the 
requirement of extensive cell divisions (10s to 1,000s) 
for the development of tumors [19, 20]. The mixtures 
of gases that make up Earth’s atmosphere sustain the 
carbon-based metazoan life forms on this planet due to 
the cellular oxidative metabolic processes. Low carbon 
dioxide (less than 0.04%) and high oxygen (20.9%) is 
crucial to the life-sustaining respiration of mammals. 
Unfortunately, the same life-sustaining oxidative 
metabolism functioning in the human cell also pro¬ 
duces reactive oxygen species (ROS) that damage our 
cellular components including membranes, lipids, 
microfilaments, enzymes, and DNA. Additional ROS 
are produced by exposures to exogenous toxicants, 
including ionizing radiation, UV light, numerous 
toxic chemicals and heavy metals. The level of 
oxidative damage to mammalian DNA has been 
reported to be as high as 10,000 insults per cell per 
day [21, 22]. DNA replication (cell division) in the 
presence of oxidative damage leads to fixed mutations 
(permanent mutations). Endogenous DNA damage 
occurs in the form of modified nucleotides (e.g., 
8-hydroxy-2'-deoxyguanosine, thymidine glycol, 5- 
hydroxymethyluracil, etc.), depurination, depyrimi- 
dination, deamination, and single and double strand 
breaks on a daily basis from oxidative processes 
[23-27]. Tallying up all these insults suggests that 
human DNA receives greater than 10,000 insults per 
cell per day. Other mammals receive similar insults at 
the level proportional to their basal metabolic rates, 
that is, mice and rats receive approximately tenfold 
more DNA damage per cell per day as humans. 
The DNA repair systems of the human cell have 
a phenomenally high fidelity, but even an error rate of 
only 0.0001% (the equivalent of one in a million) may 
lead to the accumulation of genetic errors on the order 
of 1 every 100 days for each and every cell [28, 29]. To 
add further insult to injury, the fidelities of DNA poly¬ 
merases are also less than perfect, although remarkably 
precise in the human, such that errors are incorporated 
during DNA synthesis [30] . The reported occurrence of 


one to three mutations every time the human genome 
is duplicated further supports the limitations of cellular 
functions and endurances [31, 32]. Thus the processes 
involved in sustaining life also lead to DNA mutations. 
These endogenous life processes coupled with the envi¬ 
ronment that is so crucial in supporting and nurturing 
life leads to the accumulation of DNA mutations 
and, over many years, a significantly increased risk 
of cancer. 

Nature maintains a balance between productive life 
and cancer risk. As with any genetic disease that termi¬ 
nates life before reaching a reproductive age or during 
the fertile years, the early occurrence of terminal cancer 
would be self-limiting. Barriers to cancer development 
are necessary for the survival and proliferation of the 
species. As advanced life forms evolved, so did 
a number of protective mechanisms that are quite 
effective in derailing tumor development. In addition 
to detoxifying enzymes for reactive species, glutathione 
to quench reactive electrophiles, and numerous DNA 
repair mechanisms, mammalian and human cells 
become terminally senescent or undergo self-suicide 
(apoptosis) upon self-recognition of excess DNA dam¬ 
age or perturbation of cell growth controls [33]. The 
processes involved in cellular aging, that is, loss of 
telomere length of chromosomes, shifts in gene expres¬ 
sion, and responsiveness to external signals that finally 
lead to replicative senescence after a finite number of 
cell divisions, may actually be a means to block the 
development of cancer. Terminal differentiation pro¬ 
grams, such as the short 4-5 day lifespan of the colonic 
mucosal cells, and the approximately 14 day lifespan of 
the squamous differentiating skin keratinocytes 
ensures that these cells will not be around if they pick 
up too many oncogenic DNA mutations during the 
rapid cell divisions that take place immediately prior 
to the final terminal differentiation processes [34]. 
Should these intracellular anticancer mechanisms fail, 
the local tissue microenvironment plays a role in 
containing or dampening the expansion of abnormal 
cells, and the body’s immune system may take over and 
remove aberrant cells, thereby removing the threat of 
cancer [35-37]. Considering the low frequency of 
occurrence of cancer until old age, these endogenous 
protective anticancer mechanisms are advantageous for 
the human race, albeit not necessarily for the less for¬ 
tunate individual statistical outlier. 
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Accumulation of Mutations 

Avenues are available for the determined cell to find ways 
to circumvent these protective barriers against cancer 
development. With each protective mechanism, one or 
more DNA mutations enable cells to bypass this blockade 
and progress toward a neoplastic state. These “driver” 
mutations are the primary oncogenic errors involved in 
tumor development. Due to the number and variety of 
checks and balances opposing oncogenic development, 
a cell must accumulate a sufficient number of “driver” 
mutations to circumvent all the blockades for a tumor to 
develop, without meeting its demise by one of these 
protective mechanisms. This process is a statistical prob¬ 
ability that weighs heavily on the healthy life side until old 
age, unless perturbed by carcinogenic exposures whereby 
the probability of tumor development increases (see 
Table 1). The rate of tumor development is increased 
sometimes many orders of magnitude above the spon¬ 
taneous background rate by exposure to exogenous 
carcinogens. The timing of the onset of cancer is due 
to the dose and duration of carcinogen exposure and 
not to either chronological age or the age that exposure 
began [38, 39]. 

Properties of Cancer Cells 

The product of carcinogenesis is a thriving cancer cell. 
The successful cancer cell has collected driver muta¬ 
tions that enable it to: (1) ignore antigrowth signals in 
the tissue microenvironment; (2) be at least partially 
independent of the requirements for exogenous mito¬ 
genic factors for cellular growth and proliferation; 
(3) block or inhibit differentiation programs; (4) evade 
apoptotic and senescence programs; (5) establish an 
extended replicative potential or immortality; (6) secrete 
factors that enhance and sustain angiogenesis; 
(7) acquire the ability to evade the immune system; 
and (8) acquire genomic instability [33, 35-37]. 
These are the minimal requirements for the develop¬ 
ment of a benign tumor. A malignancy has acquired 
one or more additional mutations that confer a ninth 
property, that of invasive and metastatic capabilities. 
For each of these nine properties of cancer, one or more 
mutations are needed to alter the intracellular signaling 
pathways and/or response to the tissue microenviron¬ 
ment. In some cases, only one driver mutation may be 
required to confer a specific property, but many of 


these nine properties are conferred only after multiple 
oncogenic mutations have been established in the 
developing cancer cell genome. 

Oncogenic Mutations 

The establishment of each driver mutation requires the 
successful negotiation of several protective molecular 
mechanisms. First of all, the formation of a DNA 
adduct, single or double strand break, or other form 
of DNA damage must occur due to exposure to 
a physical carcinogen (i.e., UV light or ionizing radia¬ 
tion), a chemical carcinogen, or the presence of an 
endogenous reactive species that has successfully nego¬ 
tiated the gamut of detoxifying enzymes and glutathi¬ 
one. This DNA damage must also have occurred in an 
oncogenically important base, gene, and/or sequence. 
Next this DNA damage must be converted into 
a permanent genetic alteration, a fixed mutation. 
The formation of a fixed mutation requires the com¬ 
bined failure of the cell’s genome surveillance system to 
recognize and/or repair the DNA lesion, incorporation 
of an incorrect nucleotide (miss-pairing), and the sur¬ 
vival of this mistake up to and through a second round 
of DNA synthesis, or through faulty repair [40-42] 
(Fig. 1). The frequency by which this occurs is exceed¬ 
ingly rare, as noted above by the high fidelity of the 
DNA repair system alone. The bombardment of cells in 
the human body by acute and chronic exposures to 
exogenous carcinogens increases the frequency of 
DNA damaging events and cellular turnover, both of 
which in turn increase the occurrence of fixed muta¬ 
tions and tips the balance of these processes toward 
an increased risk of cancer development (Fig. 1). 
The higher the dose received and the longer the expo¬ 
sure, the greater the risk of cancer. 

There are several mechanisms by which carcinogens 
can influence the formation of DNA mutations, and 
the type of mutation(s) formed is dependent upon the 
mode of action of the carcinogen (see Table 2). Exog¬ 
enous and endogenous agents can produce DNA muta¬ 
tions by directly damaging DNA, perturbing DNA 
replication machinery, polymerases, etc., inhibiting 
DNA repair systems, critically shortening telomere 
lengths, altering the cytoskeletal proteins, mitotic spin¬ 
dle fibers, enzymes involved in mitotic processes, and 
by stimulating cellular turnover and proliferation. 
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Exogenous Agents 


Chemical Carcinogens Oncogenic Viruses 
Physical Carcinogens 



Endogenous Mechanisms 

Chronic Inflammation 

Limited Enzyme Fidelity 
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Environmental Toxicology: Carcinogenesis. Figure 1 

A schematic representation of the accumulation of oncogenic mutations and the influences of exogenous agents and 
endogenous mechanisms on this process. Each time a cell completes the cell cycle and divides, there is a good chance 
that one or more mutations will occur. Fortunately, most of these DNA mutations are not oncogenic. However, the 
probability of an initiated cell or a cell with two or more oncogenic mutations, picking up another oncogenic mutation 
increases as clonal expansion occurs. A tumor develops once a sufficient number of oncogenic mutations have 
accumulated to bypass all of the protective barriers of the normal human cell 


These different mechanistic impacts on cells encourage 
the formation of a variety of mutations, generally clas¬ 
sified as single base mutations, chromosomal aberra¬ 
tions, insertions and deletions (indels), and epigenetic 
mutations. 

Single Base Mutations 

Single base mutations encompassing single base sub¬ 
stitutions and single base indels represent a significant 
proportion of cancer-associated genetic errors. It is 
generally accepted that the more transformed a cell 
becomes, the more likely a single base substitution 
will occur in an activating site of a proto-oncogene or 


in deactivating site in a tumor suppressor gene. Most of 
the alkylating and arylating carcinogens listed in Table 1 
are known to produce single base mutations, especially 
single base substitution mutations. Oxidation and 
the production of ROS and reactive nitrogen species 
(RNS), common with inflammatory processes, are also 
well known to produce single base mutations [25, 43]. 
The vast majority of errors produced by DNA poly¬ 
merases and the DNA replication machinery are single 
base mutations [31, 32]. 

Single base substitution (SBS) mutations represent 
85% or more of the activating and deactivating muta¬ 
tions in proto-oncogenes and tumor suppressor genes, 
respectively, in human tumors [44-48]. Although 
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Environmental Toxicology: Carcinogenesis. Table 2 Genotoxic and carcinogenic mechanisms 


Mode of action 

Effects 

Mutations 

Instability 

1. Direct DNA damage (covalent 
alterations) 

Base pair mismatch 

Single base mutation (SBS and 
Indels) 

NSI 

Depurination and 
depyrimidination 

Indels (larger than one base) 
Epigenetic mutations 

EIS 

Single strand break 

Chromosomal aberrations 
(deletions, insertions, 
translocations) 

CIN 

Double strand break 

2. Interference with DNA 
polymerases and replication 
machinery 

Base pair mismatch 

Single base mutation (SBS and 
Indels) 

NSI 

Microsatellite slippage 

Microsatellite length changes 

MSI 

Perturbation of epigenetic 
patterns 

Epigenetic mutations 

EIS 

3. Interference with 
cytoskeleton and mitotic spindle 
Mitotic proteins and enzymes 

Nondisjunction 

Aneuploidy 

CIN 

4. Interference with DNA repair 
systems 

Faulty mismatch repair 

Microsatellite length changes 

MSI 

Improperly or unrepaired 

Single base mutation Indels 

NSI 

Single and double strand 
breaks 

Chromosomal aberrations 
(translocation, inversions, deletions, 
etc.) 

CIN 

5. Interference with maintenance 
of telomeres 

Critically shortened 
telomeres 

Dicentric chromosomes aneuploidy 

CIN 

6. Cytotoxicity 

Increased cell turnover 

(Most types of mutations) 

Any possible 
(NIS, MSI, CIN, 

EIS) 

Increased opportunity 
for accumulating 
mutations 

7. Stimulate cellular proliferation 
(promotion) 

Clonal expansion 

(Most types of mutations) 

Any possible 
(NIS, MSI, CIN 

EIS) 

Increased probability 
for accumulating 
mutations 


CIN chromosomal instability, EIS epigenetic instability, MSI microsatellite instability, NIS nucleotide instability (also known as single base 
instability) 


single base deletions and insertions are sometimes 
involved, SBSs represent the vast majority of these 
oncogenic errors [47, 49, 50]. This has been clearly 
demonstrated in human tissues for a number of 
proto-oncogenes and tumor suppressor genes, for 
example, Braf, H-, K-, and N-Ras, RB and TP53 
[45-47, 49-51]. Missense SBS mutations alone com¬ 
prise more than 75% of all identified disease-associated 
TP53 mutations [49]. 


Chromosomal Aberrations 

Cancer-associated chromosomal errors have been 
known since the middle of the last century. Cytogenetic 
studies identified numerous leukemia-specific (and 
lymphoma-specific) chrosomal translocations [52]. 
The best-known example is the chronic myeloblastic 
leukemia (CML)-specific chromosomal translocation 
between chromosomes 9 and 22 [t(9;22)(q34:ql 1)], 
which forms the chimeric bcr-abl gene. Chromosomal 
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errors are prominent in many types of human tumors 
and most likely crucial to the development of at least 
some leukemias and lymphomas [52]. Aneuploidy is 
the most frequently identified mutation in human can¬ 
cers, albeit cytogenetic analysis is routine medical care 
for cancers. 

Disruption of cellular cytoskeleton, mitotic spindle 
apparatus, and/or enzymes involved in the processes of 
mitosis commonly lead to chromosomal errors. For 
example, asbestos fibers spear cells and bind proteins 
that are involved in mitosis, leading to non-disjunction 
and other chromosomal aberrations [53]. Polycyclic 
aromatic hydrocarbons (PAHs) also produce chromo¬ 
somal errors by causing the rapid shortening of the 
telomere length [54]. Tobacco smoke constituents 
play a prominent role in the chromosomal aberrations 
[55-57]. 

Indel Mutations 

Chemical and physical carcinogens are well known to 
cause single and double strand breaks in DNA. Some 
DNA adducts, such as the alkyl-N7-deoxyguanosine 
formed by alkylation (or aryl-N7-deoxyguanosine by 
arylation) may lead to a depurinic site, which weakens 
the strand and easily leads to a DNA strand break. The 
formation of the covalent adduction on the N7-position 
of purine residues in DNA destabilizes the guanidine-N9- 
l'-deoxyribose glycosidic linkage [58-60]. Single and 
double strand breaks also result from ionizing radia¬ 
tion. Two DNA strand breaks some distance apart can 
lead to deletions, the size of which is dependent upon 
the separation distance of the breaks. These carcino¬ 
gen-initiated deletions may disrupt oncogenes such as 
RB or TP53, produce loss of heterozygosity (LOH), or 
result in chromosomal translocations [39, 45, 49]. 
Large deletions (and insertions) of one million bases 
or more are detectable by cytogenetics and will fall into 
the chromosomal aberration category. 

The explanation for the formation of insertions is 
less clear, but may entail the same basic mechanisms as 
noted above for deletions, or by retrotransposition of 
viral or repeat sequences. The highly repeated Alu 
sequence, of approximately 300 nucleotides in length, 
is present in over a million copies in the human 
genome [61, 62] . Every so often, an Alu repeat sequence 
inserts itself in a new section of the DNA. This has been 


observed as a major oncogenic mutation in retinoblas¬ 
toma and numerous genetic diseases [63]. However, 
these events are rare and retrotransposition is probably 
not a prominent mechanism in cancer development. 

Epigenetic Mutations 

Another form of DNA mutation that has been demon¬ 
strated in recent years to be crucially involved in cancer 
development is the alteration of patterns of DNA 
5-methyldeoxycytidine (5mdC) and/or changes in DNA 
chromatin modification. This type of mutation is referred 
to as an “epigenetic mutation,” since these changes do not 
alter the basic DNA coding sequence. To clarify firstly 
DNA 5mdC patterns, selective deoxycytidine residues in 
DNA are enzymatically modified by endogenous DNA 
methyltransferases (DNMTs) that catalyze the transfer of 
a methyl group from S-adenosylmethionine to the 
5-position on the cytosine base, forming 5mdC [64]. In 
adult somatic cells, this normal DNA modification 
occurs almost exclusively at 5 / -CpG-3 / sequences and 
duplex DNA is maintained in a symmetrically methyl¬ 
ated state, so that the complementary 5 / -CpG-3 / 
sequence is also methylated on the opposing strand 
[65, 66] . It should be noted that in stem cells, especially 
embryonic stem cells, a significant portion of the 5mdC 
also occurs in non-CpG sequences [65, 66]. The mean¬ 
ing of these non-CpG methylated sequences is pres¬ 
ently unknown. Maintenance of symmetrically 
methylated duplex DNA ensures the mitotic heritabil- 
ity of these DNA 5mdC patterns in somatic cells by 
providing the 5mdC pattern template for each newly 
synthesized strand following semiconservative DNA 
synthesis. The level of 5mdC on a specific DNA locus 
or gene promoter region determines whether the 
sequences are expressed or not. Highly methylated 
promoter sequences 5' to the coding sequence of 
a given gene indicate that the gene is not available for 
transcription [67]. Today this is commonly referred to 
as gene hypermethylation, which is identified fre¬ 
quently as an epigenetic mutation in one or more 
tumor suppressor genes in human cancers [67, 68]. 
Hypomethylation of these same promoter sequences 
opens the gene sequences up for transcription, but 
does not indicate that the gene is being transcribed 
since additional DNA binding proteins and transcrip¬ 
tion factors are required for expression of the gene. 
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Not surprisingly, DNA 5mdC patterns change during 
tumor development [69, 70], and chemical carcinogens 
are known to alter DNA 5mdC patterns [71-75]. 

Chromatin modifications are more complex than 
DNA 5mdC patterns and include a number of different 
molecular moieties. The basic repeating unit of chroma¬ 
tin is the nucleosome, which is composed of 147 base 
pairs of DNA wrapped around a core containing two 
copies each of four core histone proteins H2A, H2B, 
H3, and H4 [76, 77]. Various molecular modifications 
of the N-terminal tails of these core histones occur in 
mammalian cells. In 1964, Allfrey et al. [78] first 
reported histone N-terminal tail modification, acetyla¬ 
tion, but it was many years later before the complexity 
and importance of this posttranslational modification 
became known. As it turns out, multiple histone 
modifications occur, consisting of methylation 
[79-81], acetylation [82], phosphorylation [83], 
ubiquitinylation [84], and sumoylation [85]. The pat¬ 
terns and combinations of these modifications are 
referred to as the histone code [86], which influences 
the chromatin structure and remodeling for either 
active expression or silencing of genes [87] . Euchroma- 
tin (active chromatin) along with active gene expres¬ 
sion was associated with methylation of the lysine 
residue in the fourth position of the N-terminal tail 
of histone H3 (denoted as H3K4me), while H3K9me 
and K3K27me formation were linked to heterochro¬ 
matin (silent chromatin) and repression of gene 
expression [79-81]. These patterns become even more 
complex as each of these histone lysine residues may be 
mono-, di-,or tri-methylated (i.e., H3K4me 1 , 
H3K4me 2 , and H3K4me 3 ) [88-91]. The importance 
of multiple methyl groups on lysine residues is pres¬ 
ently only poorly understood. Histone methylation also 
takes place on other lysine residues on histone H3 K36 
and K79, and histone H4 K20, and on arginine residues 
histone H3 R2, R17, and R26, and histone H4 R3. 

Acetylation of the positively charged lysine residues 
decreases the effective ionic charge, reducing the affin¬ 
ity to the negatively charged phosphate backbone of 
DNA and creates a binding side for regulatory DNA 
proteins that activate transcription [92]. Essentially, 
histone acetylation alters the chromatin structure to 
a more transcriptionally supportive architecture. Acet¬ 
ylation of the amino group of lysine residues occurs on 
histone H2A K5 and K9, histone H2B K5, Kll, K12, 


K15 and K20, histone H3 K9, K14, K18, K23 and K56, 
and histone H4 K5, K8, K12 and K16 [86]. Acetylation 
patterns are established by highly regulated histone 
acetyltransferases (HATs) and histone de-acetylases 
(HDACs) [93]. HAT enzymes are grouped into differ¬ 
ent family classes, GNAT family, MYST family, and 
CBP/p300 family [94]. Some transcription factors 
such as yeast protein Gcn5, a putative transcriptional 
adaptor, are HAT enzymes [95]. HDACs catalyze the 
removal of acetyl groups from histones by activation of 
a water molecule through a zinc ion in the active site 
[96, 97], and are involved in formation of heterochro¬ 
matin and the silencing of genes. In human cells, 18 
nonredundant HDACs have been identified [98], 
which have been divided into classes I, II, and III 
[99]. Class I HDACs are localized within the nucleus 
while Class II shuttle between the nucleus and cyto¬ 
plasm [98]. Regulation of HDACs consists of protein- 
protein interactions, posttranslational modifications, 
cellular localization, and metabolic cofactors [100]. 

In 1967, Gutierrez and Hnilica [101] first reported 
histone phosphorylation of serine residues on H3S10 
and H3S28. Histone phosphorylation appears to be 
involved in cell cycle chromatin condensation, active 
transcription, apoptosis, and DNA repair mechanisms 
[102, 103]. Histone H3T11 and H3S28 have been iden¬ 
tified as mitosis-specific sites for phosphorylation [ 104, 
105]. Histone phosphorylation is also involved with 
activation of early response genes such as c-fos, 
c-myc, and c-jun. This is a stimulant-dependent H3 
phosphorylation, resulting in quick transcriptional 
activation of a subset of immediate-early genes [106]. 
Although, acetylation and phosphorylation are both 
linked to transcription activation, it is unknown 
whether the two modifications work independently, 
in parallel, or synergistically [107, 108]. Phosphoryla¬ 
tion also occurs on serine and threonine residues on 
histone H2A SI, histone H2B S14, and histone H4 SI. 

Histone ubiquitinylation and sumoylation are the 
least understood of the chromatin modifications 
[84, 85]. However, these modifications maybe impor¬ 
tant in DNA damage response and repair, V(D)J recom¬ 
bination, meiotic and spermatogenesis remodeling, and 
chromosomal stability [109-114]. 

It is becoming clear that histones are major carriers 
of epigenetic information. Histone modifications pro¬ 
vide mechanisms to generate and stabilize chromatin 
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structures. The study of chromatin modification pat¬ 
terns is a very young field, which will undoubtedly 
become very important to the delineation of carcino¬ 
genesis. In fact, evidence is already beginning to accu¬ 
mulate regarding alterations in chromatin histone 
modifications due to exogenous chemical carcinogen 
exposures [75, 115]. 

Cytotoxicity and Cellular Turnover 

Most exogenous carcinogens are cytotoxic at recog¬ 
nized carcinogenic doses, and probably at much lower 
doses to impacted cells. This concept is easy to envision 
when considering agents that directly damage cellular 
macromolecules (i.e., ionizing radiation, alkylating and 
arylating carcinogens). Interference with cellular func¬ 
tions, enzymatic processes, cytoskeletal structure, 
mitochondrial electron transport and oxidative phos¬ 
phorylation, etc. may lead to cell senescence or death. If 
the DNA is damaged to the extent that successful repair 
is doubtful, the cell undergoes apoptosis as mediated by 
TP53 [39, 116]. Both senescence and apoptotic path¬ 
ways are protective means by which cells may avoid 
accumulating DNA mutations [33]. 

Concentrations of chemical carcinogens and doses 
of physical carcinogens that produce mutagenicity in 
experimental studies generally also produce 
a significant percentage of cell lethality [117]. Whether 
cell death is the result of cellular damage inflicted 
directly by the carcinogen or from triggering apoptosis, 
the loss of the cell leaves a gap to be filled by cellular 
turnover. In tissue, this leads to increased cell division 
to replace these lost cells. This damage-induced cell 
proliferation offers increased opportunities for the 
accumulation of mutations during DNA synthesis 
and mitotic division. Cells adjacent to the vacated 
space will have the chance to divide to replace the 
missing cell. One or more of these adjacent cells 
might harbor a mutation that confers a growth advan¬ 
tage enabling it to dominate over its neighbors and 
progress through cell division. Thus, the potential for 
clonal expansion and the continued accumulation of 
oncogenic mutations is encouraged by the impact of 
carcinogens on tissues. 

The opportunity or encouragement of carcinogen 
exposed cells to undergo cell division provides yet 
another enhancement for mutation occurrence. Many of 


the cells that have the opportunity to divide have also 
received a carcinogen-induced damaging event, which if 
not repaired prior to DNA synthesis may lead to a fixed 
mutation [117]. For example, the presence of an alkyl- 
0 6 dG adduct during DNA synthesis will result in the 
formation of a G to T mismatch approximately 50% of 
the time. Alkyl-0 6 dG base pairs equally efficiently with 
either a thymidine or a cytidine residue [60, 118, 119]. 
Different DNA adducts and forms of damage will result 
in different mutational events following DNA synthe¬ 
sis. But the cytotoxic action of these carcinogens cou¬ 
ples with the DNA damaging activity to induce 
mutations. 

When carcinogen induced cell death involves 
a tissue stem cell, the dead cell is replaced by another 
stem cell. Most of cancer arises from tissue stem cells. 
The accumulation of mutations in a transit-amplifying 
progenitor cell will usually result in no consequence as 
the developing cancer cell will be terminally lost either 
by terminal differentiation and cell death or by being 
sloughed off the epithelial layer [34]. Stem cells have 
the capability of surviving long enough to accumulate 
the necessary mutations to become cancerous. Even in 
rapidly turning over tissues, stem cells appear to be 
long lived, with estimated lifetime division numbers 
totaling in the hundreds and higher [120]. But these 
stem cells also enjoy a slow or delayed cell cycle, divid¬ 
ing intermittently (only once every 7 or 8 days), and 
often sitting quiescent and in a nonproliferative or 
reserve stem cell state [121-123]. Stem cell division is 
generally considered to be asymmetric, giving rise to 
both a replacement stem cell and a progenitor cell that 
will undergo numerous symmetrical cell divisions and 
then terminally differentiate [120, 124-126]. This 
asymmetric cell division appears to also include the 
retention of the original DNA strand by the daughter 
stem cell replacement. Or stated differently, the original 
template DNA strand of the mother stem cell selectively 
segregates to the daughter stem cell replacement, while 
the newer DNA strand segregates to the transit- 
amplifying daughter cell. This is the Cairns’ “immortal 
stranding” hypothesis [127], which appears to have 
experimental validity [124, 126]. DNA synthesis in 
human cells accumulates one to three mutations 
every time the genome is copied [31, 32]. “Immortal 
stranding” enables the stem cell to minimize the accu¬ 
mulation of DNA mutations, and thus limit the 
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number of mutations that are passed on to the transit- 
amplifying differentiating epithelial cells. 

Adult stem cells are very sensitive to DNA damage, 
and rather than perform DNA repair, stem cells appear to 
undergo apoptosis in response to external exposure to 
DNA damaging agents [128]. Discarding the cell rather 
than risk faulty repair ensures that the accumulation of 
DNA mutations in stem cells does not occur. However, 
the loss of one or more stem cells due to external toxic 
exposure(s) (UV light, ionizing radiation, arylating or 
alkylating chemicals, or inflammation or infection pro¬ 
duced ROS), requires the replacement of these stem 
cells. Stem cell replacement requires a symmetrical cell 
division to give rise to two daughter stem cells, which 
will produce one “aged” DNA template strand regard¬ 
less of “immortal stranding.” This process increases the 
number of mutations arising in the stem cells and 
increases the risk of disease. Alternatively, a transit- 
amplifying cell may undergo dedifferentiation, charac¬ 
teristic of mesenchymal-epithelial transition [129]. 
Unfortunately, this dedifferentiation process again pro¬ 
duces a stem cell without the benefit of “immortal 
stranding” and increases the risk of mutations and 
disease developing in the tissue stem cell population. 

Genomic Instability 

Genomic instability is the loss of genomic stability and 
the increased probability for accruing DNA mutations. 
Restated, genomic instability is the inability of cells to 
maintain the integrity of their own genomic inheritance. 

It is generally accepted that most cancers express one 
or more forms of genomic instability. However, among 
the scientific community there is still controversy as to 
whether the loss of genomic stability is required for the 
development of cancer. One argument states that geno¬ 
mic instability is not necessary for the accrual of 
a sufficient number of oncogenic mutations to reach 
a malignancy [130, 131]. The other argument states 
that human cells would not have the opportunity to 
accumulate a sufficient allotment of driver mutations to 
achieve a cancerous cell without an additional navigating 
mechanism(s) [132]. The statistical probability during 
a normal human life span has been proposed to be too 
low based on theoretical cell division numbers alone. 
However, many oncogenic mutations, such as in “care¬ 
taker” genes, result in the enhanced probability of 


initiated cells achieving more mutations and progressing 
along the road of tumor development [31, 40, 42, 133]. 
Caretakers are genes that code for cellular proteins and 
factors that protect the genome from mutation. Dis¬ 
ruption of the finely balanced cellular network of mon¬ 
itoring and repair systems will lead to a loss of fidelity 
of these genome protection systems, increases in mac- 
romolecular damage, increased longevity of DNA dam¬ 
age, faulty repair, and an increased accumulation of 
DNA mutations. For example, a mutation that disrupts 
the normal function of a mismatch DNA repair gene 
leads to upward of a 100-fold increased rate of cellular 
DNA mutation [40]. 

Genomic instability is a hallmark of cancer and one 
of the trademarks of a number of inherited genetic 
diseases that predispose individuals to cancer [131] 
(see Table 1). By today’s laboratory techniques, geno¬ 
mic instability manifests as chromatin instability 
(CIN), microsatellite instability (MSI) (sometimes 
referred to as MIN), nucleotide instability (NIS) (also 
called single base instability), and epigenetic instability 
(EIS) (see Table 2). 

Chromosomal Instability (CIN) 

The inability of the cell to properly repair single or 
double strand DNA breaks, perform appropriate 
recombination activities in somatic cells, or correctly 
perform the mitotic spindle functions during mitosis 
will lead to the development and accumulation of 
chromosomal aberrations, which is the manifestation 
of CIN [33, 134]. Cells expressing CIN miss-segregate 
chromosomes 10-100 times more frequently than 
chromosomally stable diploid cancer cells [135]. CIN 
may promote tumor evolution by enabling the clonal 
expansion of cells with proliferative advantages and/or 
metastatic potential [136-138]. 

Chemical and physical carcinogens can interfere 
with the cytoskeleton, mitotic spindle apparatus, or 
the various proteins and enzymes involved in mitosis. 
The impact of such carcinogenic damage is the 
increased probability of chromosomal mutations. As 
noted above, asbestos produces aneuploidy in skewered 
cells by interfering with the cytoskeleton and mitotic 
proteins [53]. Tobacco smoke PAHs, BPDE, and 
4-aminobiphenyl enhance telomere shortening and 
produce CIN in vivo and in vitro studies [54-57]. 
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Microsatellite Instability (MSI) 


Nucleotide Instability (NIS) 


Microsatellites are relatively short sequences composed of 
tandemly repeated units of one to six bases in length 
[139]. The repeated unit of sequence may be only 
a mono-nucleotide repeat (e.g., AAAAAAA...), di¬ 
nucleotide repeat (e.g., C AC AC AC AC AC AC A...), tri¬ 
nucleotide repeat, etc., although microsatellites of 
imperfect and compound sequences also exist in the 
human genome (e.g., Bat25 and Bat26) [140]. The most 
common class of microsatellites in the human genome 
are composed of various tandemly repeated lengths of 
(CA) n . Microsatellites are widely prevalent throughout 
the human genome, occurring more frequently in non¬ 
coding regions on all chromosomes, and act as gene 
promoters, as target sites for recombination, and as 
binding sites for DNA topoisomerases [139]. The 
length of a specific microsatellite is an inherited trait 
and therefore the same in all tissues. Insertion or dele¬ 
tion of one or more repeat units within a microsatellite 
occurs on rare occasion by slippage during DNA repli¬ 
cation, resulting in a longer or shorter microsatellite 
sequence, respectively. These polymerase slippage 
errors are normally repaired by a post-replication 
mismatch repair system, and the presence of 
a microsatellite variant generally results from the lack 
of fidelity in the mismatch repair mechanism [ 141-143]. 
MSI is denoted by the presence of microsatellite vari¬ 
ants of different lengths in tumors compared to normal 
tissue, and is a hallmark of mismatch repair deficient 
familial and sporadic cancers [144, 145]. 

In theory, a carcinogenic agent that interferes with 
DNA polymerases and/or the mismatch repair system 
would lead to MSI. For many types of cancer, MSI is 
the one genomic instability that is consistently acquired 
late in the tumorigenic process [139]. The major 
exception to this rule is the hereditary nonpolyposis 
colorectal cancer (HNPCC) syndrome, for which indi¬ 
viduals inherit an error in the mismatch repair system 
and are at risk of developing colon cancer [146, 147]. 
However, airborne particulates (PM 2 . 5 ) have been asso¬ 
ciated with the development of MSI in human lung 
epithelial cells [148, 149]. Cancers induced by betel- 
quid chewing and chronic occupational exposure to 
chromium have been found to express MSI signifi¬ 
cantly more frequently than noncarcinogen-associated 
tumors [150, 151]. 


Nucleotide instability may be one of the more easily 
induced instabilities by either exogenous or endogenous 
influences. Increases in the frequencies of various forms 
of DNA nucleotide damage and/or loss of cellular DNA 
damage surveillance and repair capacity, or the pertur¬ 
bation of the DNA polymerases or synthesis complex 
machinery result in increases in single base mutations 
[19, 46-48, 132]. With the possible exception of chro¬ 
mosomal translocations in leukemias and lymphomas, 
NIS is probably the most common genomic instability 
expressed in human cancer [47, 49, 50] . NIS may also be 
predominant in the earliest mechanisms of carcinogen¬ 
esis as roughly 85% of the activating and deactivating 
mutations in proto-oncogenes and tumor suppressor 
genes, respectively, are single base substitution muta¬ 
tions [44-48]. Much of the NIS data in the literature 
has been accumulated from clonal expansion of muta¬ 
tions in tumor tissue, that is, TP53 mutation database 
and retinoblastoma gene mutations [49, 50]. The anal¬ 
ysis of NIS has been hampered by the requirement of 
exceedingly sensitive techniques, and the limited viable 
ultrasensitive approaches are labor intensive [47]. How¬ 
ever, with the present availability of the ultra-sensitive 
methods, the presence of NIS in non-tumor tissues and 
the induction of NIS by exogenous environmental car¬ 
cinogens are beginning to unfold [48, 152]. 

Epigenetic Instability (EIS) 

Although any alteration from normal of DNA 5mdC 
and/or chromatin modification patterns constitutes 
epigenetic instability, the majority of the scientific lit¬ 
erature has focused on hypermethylation, that is, the 
transcriptional suppression of important anticancer 
genes by high levels of 5mdC present in the gene 
promoter sequences [67, 68]. In human cancer, a 
hypermethylation phenotype, referred to as CIMP 
(CpG island methylator phenotype), has been pro¬ 
posed [153]. The CIMP phenotype may have clinical 
significance, but the cause or mechanism underlying 
the CIMP tumor phenotype has yet to be identified 
[154-156]. Regardless of the gene specific 
hypermethylation ascertainment biased approach, can¬ 
cer cells are globally hypomethylated and harbor 
20-60% less genomic 5mdC than their normal coun¬ 
terpart. The hypomethylated sequences in cancer cells 
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may be mainly due to the loss of 5mdC levels in repet¬ 
itive sequences, which are normally heavily methylated 
and comprise 20-30% of the human genome [157]. 
For at least some cancers, the level of global 
hypomethylation is associated with progression, 
becoming progressively more hypomethylated from 
normal to benign, to tumor and metastasis [158— 
160]. Indeed, transient global hypomethylation is 
inducible by either chemical carcinogens or hypoxia 
[71, 72, 161]. Interestingly, increased NO production 
that may occur in inflammatory disease, or in response 
to hypoxia, reduces S-adenosylmethionine levels by 
inactivating methionine adenosyltransferase, and pro¬ 
duces hypomethylation [162]. Dietary deficiencies in 
either folic acid or methyl donor compounds, that is, 
methionine, also cause hypomethylation [6-8]. 
Hypomethylation of gene-specific sequences in cancer 
cells leads to ectopic gene expression, such as neuron- 
specific y-synuclein expression in gastric cancers [ 163] . 
The degree of hypomethylation of the XIST gene cor¬ 
relates with prostate cancer aggressiveness [164]. 

There may be several mechanisms at work in the 
alterations of epigenetic signals during carcinogenesis 
as these patterns are intimately involved in all biolog¬ 
ical processes, that is, genomic imprinting, gametogen- 
esis, embryogenesis, differentiation, aging, and 
carcinogenesis. As noted above, dietary deficits are 
known to alter DNA 5mdC levels and patterns [6-8]. 
Alkylation, arylation and other forms of DNA damage 
induced by carcinogens have been clearly shown to 
disrupt DNA 5mdC patterns [71, 72]. DNA epigenetic 
patterns may also be impacted by many other environ¬ 
mental factors, including tobacco use, drug use, phys¬ 
ical exercise, and health status as DNA 5mdC patterns 
diverge with age in monozygotic twin pairs [165]. 

Chromatin modification patterns are well known to 
be involved in carcinogenesis and the development of 
cancer [166,167]. However, this field is still developing, 
and the data regarding exogenous agent-induced 
changes in histone modifications are presently limited. 
Several in vitro tissue culture studies have been 
performed that suggest toxicants induced changes in 
selective histone residue methylation or acetylation 
(see review [75]), but the interpretation of these spe¬ 
cific histone modification changes amid the complex 
arrays of epigenetic patterns and extrapolation to in 
vivo will require further analyses. 


To summarize genomic instability, there are a large 
number of genes involved in the maintenance of the 
genome. Many of these genes, such as the DNA repair 
genes, have been referred to as caretaker genes for their 
role in genomic stability, while others such as the RB 
and TP53 genes have been denoted as gatekeeper genes 
due to their involvement in cell cycle control and the 
pathway to cancer development [168]. Each of these 
instabilities enhances the probability of accumulating 
oncogenic mutations and progression along the road 
of tumor development, the acquisition of invasive or 
metastatic capability, and/or resistance to chemothera¬ 
peutic regimens. This last effect has also been observed 
in non-chemotherapeutic carcinogens, whereby cells 
exhibiting selective genomic instabilities are resistant 
to the cytotoxic effects of some carcinogens. Cells 
expressing CIN were found to be resistant to the lethal 
effects of PhIP but not to the methylating carcinogen, 
AT-methyl-N'-nitro-iV-nitrosoquanidine (MNNG), 
while cells expressing MSI were resistant to MNNG 
but not to PhIP [169]. This provides yet another 
important mechanism by which chemical carcinogens 
may select for mutant cells and promote their clonal 
expansion by exerting cytotoxic effects on neighboring 
cells in the tissue. 

Susceptible Individuals 

Based on the above discussion of genomic instabilities, 
it is not surprising that some individuals in the human 
population have inherited genetic errors that render 
them more prone to developing cancer [47, 131] (see 
Table 1). For a portion of these susceptible individuals, 
the risk of cancer is quite high. For example, individ¬ 
uals who have inherited a mutation in one of the 
mismatch repair genes (i.e., MSH2, MLH1, MSH3, 
MSH6, PMS1, and PMS2) are carriers of hereditary 
nonpolyposis colorectal cancer (HNPCC) syndrome 
and are at high risk of colorectal carcinoma [139, 146, 
147] . Inheritance of any one of several DNA repair gene 
mutations confers a genetic disease (i.e., xeroderma 
pigmentosum, ataxia telangiectasia, Fanconi anemia, 
Cockayne syndrome, and others), which is associated 
with adverse clinical symptoms including a substan¬ 
tially elevated risk of cancer. Individuals harboring 
a germ line mutation in the TP53 gene, referred to as 
the guardian of the genome gene, generally present as 
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a Li-Fraumeni Cancer Family Syndrome and are at 
a substantially elevated risk of multiple types of cancers 
during their lifetime [170]. More than 100 Mendelian 
diseases that confer predisposing cancer syndromes 
have been recorded in “McKusick’s Online Mendelian 
Inheritance in Man” (OMIM), each with varying levels 
of cancer risk [171]. 

There are however heritable mutations that convey 
more subtle increases in cancer risk, especially when 
environmental exposures to exogenous carcinogens are 
involved [47,172]. Polymorphisms in genes involved in 
the deactivation and elimination of toxic metabolites 
and carcinogens may confer to the individual 
a decreased ability to remove offending toxicants and/ 
or reactive electrophiles before they can cause DNA 
damage, which provides a slightly increased probability 
of developing cancer. Examples of these susceptibility 
traits are found in polymorphisms among the phase 
I and phase II biotransformation enzymes, such as 
cytochrome P450 and glutathione-S-transferase genes 
[ 172-174] . These individuals are at an increased risk of 
cancer from exposures to polycyclic aromatic hydro¬ 
carbon (PAH) carcinogens commonly found in 
tobacco smoke and other combustion products [173, 
175]. Depending upon the metabolic pathway used to 
activate, deactivate, and/or eliminate carcinogens, gene 
polymorphisms may render individuals more suscep¬ 
tible or less susceptible to cancer from exposure to 
selective carcinogenic agents. 

Dose and Complex Dose 

Humans are exposed to a plethora of toxic and carci¬ 
nogenic materials in the environment. Exposure to 
complex toxic mixtures poses a higher risk of detri¬ 
mental health effects than that represented by the 
toxicity of each separate individual constituent. 
When an individual or animal is exposed to 
a complex mixture of carcinogenic agents, the dose 
required to produce significant health problems and 
tumor development will be lower than that for each 
separate individual constituent. In toxicology, it is 
well recognized that exposures to significant levels of 
complex mixtures of toxic compounds results in 
increased detrimental health effects. This is also true 
for carcinogenic agents. The combined toxic effect of 
separate toxicants is generally predicted to be at 


a minimum additive, unless specific information 
exists to demonstrate a synergistic (above additive) 
or inhibitory (below additive) action. Many of these 
carcinogens target the same organs or tissues, and 
induce similar toxic effects in humans and animals. 
For example, alcohol use, foodborne aflatoxins and 
hepatitis B (or C) all adversely impact the liver and 
promote the development of hepatocellular carci¬ 
noma. Similarily, asbestos, arsenic, and tobacco 
smoke drive the development of lung cancer. For 
each of these examples, some differences in agent- 
specific direct DNA damage may exist, but similar 
mechanisms of cellular damage, enhanced cellular 
turnover, inflammatory responses, and ROS and 
RNS species involvement contribute to the develop¬ 
ment of cancer. 

Although people are not generally exposed to an 
acute carcinogenic dose for each one of these environ¬ 
mental carcinogens, they may receive an accumulated 
dose sufficient to induce cancer due to long-term 
chronic low dose exposure. Damage to the skin from 
lifetime exposure to UV radiation from sunlight is one 
of the more obvious examples. However, the same may 
be true for a number of environmental chemical car¬ 
cinogens, especially if one has been chronically exposed 
to tobacco smoke, whether as a smoker or nonsmoker. 
What is known regarding what constitutes 
a carcinogenic dose is based mostly on laboratory stud¬ 
ies involving a single agent, not a complex mixture. It 
should note, however, that tobacco smoke is a complex 
mixture, and epidemiological studies have provided 
estimates of carcinogenic risk based on the number of 
cigarettes smoked per day and the number of pack years 
of smoking [176, 177]. Estimates of carcinogenic risk 
are generally derived from the carcinogenic dose of 
individual agents. The combined effects of exposures 
to multiple environmental carcinogens must certainly 
lower the carcinogenic dose threshold for individual 
carcinogens. Thus the chronic long-term low-dose 
exposure to a multitude of carcinogenic agents in our 
food, air, water, from sunlight, x-rays, and biological 
pathogens, enhances the risk of cancer as our bodies 
reach old age. Whether cancer is inherent with life or 
not, moderating one’s exposure by avoiding habitual 
exposures, that is, tobacco smoke and alcohol abuse, 
and maintaining a healthy lifestyle significantly lowers 
one’s lifetime risk of cancer. 
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Environmental Carcinogens 

To reiterate, the rate of tumor development is increased 
sometimes many orders of magnitude above the spon¬ 
taneous background rate by exposure to exogenous 
carcinogens. The timing of the onset of cancer is due 
to the dose and duration of carcinogen exposure and 
not to either the chronological age or the age that 
exposure began [38, 39]. Regardless of whether 
a cancer develops from carcinogen exposure(s) or 
endogenous processes, or the combination of exoge¬ 
nous and endogenous attacks (the most likely explana¬ 
tion), the accumulation of oncogenic driver mutations 
and a large number of cell divisions are required. 

Separating the effects of exogenous versus endog¬ 
enous genotoxic damage and cancer induction may 
not be an easy undertaking. More than likely, a tumor 
originating only from exogenous carcinogen exposure 
is rare if nonexistent. However, the impact of exoge¬ 
nous exposures to genotoxic agents is easily demon¬ 
strated by the significant increases in lung cancer of 
smokers [176], and by the patterns of oncogenic single 
base substitution mutations in the TP53 gene in lung 
cancers and breast carcinomas from smokers. Lung 
and breast tumors arising in smokers are twice and 
ten times, respectively, as likely to harbor a TP53 gene 
driver mutation of the G to T transversion type than 
similar tumors arising in nonsmokers [178, 179]. 
Benzo(a)pyrene and other PAHs are known to target 
deoxyguanosine in the most commonly mutated sites 
(codons 157, 248, and 273) observed in lung tumors, 
and to produce G to T transversion mutations in these 
same base sites [180-182]. Similar findings have 
reported for aflatoxin B1 adduction and 
G to T mutation of codon 249 of TP53 [183-185]. 
Sunlight (UV light) produces signature CC to TT 
mutations in TP53 and other DNA sequences that 
have been identified in sun exposed skin, which is 
well correlated with the occurrence of skin cancers 
[186-188]. Unfortunately, few carcinogens presently 
provide such clearly apparent tumor-associated 
agent-induced genotoxic damage. Signature diseases 
that aid in identifying the exogenous impact of toxi¬ 
cant exposure are also limited to only a few carcino¬ 
gens, for which asbestos and mesothelioma are the 
best known examples. Many other cancers have 


multiple etiological causes, and delineation of specific 
carcinogen exposures in the causation is not easily 
identified (see Table 1). 

The following are brief descriptions of the 
genotoxic processes and carcinogenic mechanisms for 
a few examples of common environmental carcino¬ 
gens, which are provided here to clarify the discussions 
of the impact of these exposures on the development of 
human cancer. The discussions are not intended to be 
fully comprehensive. Extensive literature is available of 
each of these agents and numerous references are pro¬ 
vided for the reader. 

Alcohol 

Chronic alcohol consumption is associated with oral 
cavity, pharynx, larynx, esophageal, liver, and colorectal 
cancer, and breast cancer in women [12, 13, 16]. Alcohol 
use is also suspected to be involved in the etiology of 
pancreatic and lung cancers. The development of alcohol- 
induced cirrhosis of the liver is a well-known etiology for 
hepatocellular carcinoma development [ 16] , and the risk 
of hepatocellular carcinoma increases with the syner¬ 
gistic impact of tobacco smoke and alcohol, or of 
hepatitis (B or C) and alcohol [12]. However, pure 
ethanol itself is not genotoxic, but its primary metab¬ 
olite acetaldehyde is mutagenic [189, 190]. 

The present theories regarding the carcinogenic 
mechanisms of alcohol are based on DNA damaging 
events, tissue inflammation and perturbation of nutri¬ 
tion and physiological functions [13, 14]. Acetaldehyde 
forms DNA adducts [189, 190], and damages hepato- 
cytes and tissue architecture, which leads to increased 
cellular proliferation [191]. Alcohol induces CYP2E1, 
whose gene product (P450 2E1) adds to the metabo¬ 
lism of alcohol to acetaldehyde as well as activating 
tobacco carcinogens (i.e., nitrosamines) [12]. Alcohol 
also induces ROS & RNS [192], which leads to lipid 
peroxidation and further DNA damage from reactive 
electrophilic products of the lipid peroxidation reac¬ 
tions [193]. Alcohol intake interferes with folate 
metabolism [194] and folate deficiency increases the 
risk of colorectal carcinoma by two to five times [195]. 
Alcohol may also alter the absorption and metabolism 
of vitamins, such as vitamin B 12 , B 6 , and retinoic 
acid, and often results in malnourishment in the 
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alcoholic [13, 14, 194]. Thus chronic alcohol use uti¬ 
lizes at least three different avenues of carcinogenesis; 
DNA damage and mutation, inflammatory processes 
that promote proliferation, and vitamin deficiencies 
that are known to perturb DNA epigenetic patterns. It 
is not surprising that alcohol works synergistically with 
other carcinogens such as tobacco smoke to increase 
the occurrence of cancers [12]. 

Sunlight 

UV light from the sun is probably the most important 
naturally occurring environmental carcinogen to which 
humans are exposed. Chronic exposure to sunlight and 
repeated sunburns are strongly correlated with 
nonmelanoma skin cancers (mainly squamous cell car¬ 
cinoma and basal cell carcinoma), especially on the 
head, neck, forearms, and hands, which clothing gen¬ 
erally leaves uncovered [196]. Skin cancer is one of the 
more frequent cancers in aging populations, and is the 
most common cancer in the USA [9, 10]. The lifetime 
risk of developing skin cancer in the USA is one in five. 
Fortunately, most skin cancers are amenable to early 
detection and a high cure rate. Melanoma is the excep¬ 
tion, leading to a lethal disease course in greater than 
10% of cases. 

UV light is comprised of electromagnetic radiation 
wavelengths below 400 nm, and has been broken down 
to vacuum UV (100-200 nm), UVC (200-290 nm), 
UVB (290-320 nm), and UVA (320-400 nm). [Note 
that some authors refer to the wavelength ranges for 
UVB and UVA as 280-315 nm and 315-400 nm, 
respectively (e.g., Ridley et al. 2009 [197]).] The 
upper layers of the Earth’s atmosphere filter out much 
of the harmful UV shorter wavelength, although this 
will become less effective and a potentially major health 
issue as the ozone layer thins over parts of the planet. 
The remaining UVA and UVB radiation reaches the 
ground. Both UVA and UVB damage cells and macro- 
molecules in cells. UVB provides the major DNA muta¬ 
tional impact producing signature cyclobutane 
pyrimidine dimers and pyrimidine 6-4 pyrimidone 
photoproducts, which if not repaired, result in signa¬ 
ture CC to TT and C to T mutations, respectively [186, 
187, 198]. Cyclobutane pyrimidine dimers are respon¬ 
sible for approximately 80% of the UV light signature 


mutations, since the 6-4 photoproducts are repaired 
much more efficiently than the cyclobutane pyrimidine 
dimers [188, 197]. Cyclobutane pyrimidine dimers 
are only slowly repaired by a transcription-coupled 
repair system. These UV signature mutations are 
known to occur in at least one important tumor sup¬ 
pressor gene for human cancer, the TP53 gene [186]. 
UV-induced TP53 mutations probably represent an 
early step in the skin cancer carcinogenesis pathway 
for basal cell carcinoma and squamous cell carcinoma 
[186, 199]. 

UVA radiation also contributes to the carcinogenic 
impact of sunlight exposure. UVA radiation leads to 
DNA damage and mutations, albeit at more than 
a 1,000-fold lower extent than UVB. However, UVA 
penetrates more deeply into the skin than UVB and 
produces UV signature mutations in the basal layer, 
while the majority of UVB produces signature muta¬ 
tions in suprabasal keratinocytes [200]. UVA also ini¬ 
tiates substantial cellular macromolecular damage, 
ROS, and lipid peroxidation [197]. 

UV light exposure may enhance the development of 
skin tumors both by inducing mutations and by inter¬ 
fering with the turnover of keratinocytes [201-203]. 
UV irradiation induces apoptosis in severely damaged 
keratinocytes. However, insulin-like growth factor-1 
(IGF-1) produced by dermal fibroblasts opposes UV- 
induced apoptosis in keratinocytes and induces 
replicative senescence by NFk| 3 and TP53 signalling 
[201-204]. UV damaged keratinocytes that survive 
apoptosis still retain the capacity to divide unless held 
in check by IGF-1. Aging skin fibroblasts produce sig¬ 
nificantly less IGF-1, leading to the increased potential 
of damaged and mutant keratinocytes replicating. This 
may explain the increased incidence of nonmelanoma 
skin cancer in geriatric patients [205]. 

The combined effects of UVB and UVA to multiple 
cells in the epidermal layer result in localized tissue 
inflammation, which contributes to the melee of oxi¬ 
dative reactions and includes reactive nitrogen species 
[25, 27, 43]. The setting of cellular damage and inflam¬ 
mation results in a tissue microenvironment that 
encourages cellular turnover, promotes clonal expan¬ 
sion of mutated cells that leads to the accumulation of 
oncogenic mutations and ultimately to skin cancer 
development. 
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Tobacco Smoke 

Tobacco smoke is the most frequent cause of lung 
cancer and probably the most important anthropo¬ 
genic carcinogen of the last century. Whether by life¬ 
style choice of smoking or by exposure to side-stream 
smoke for the nonsmoker, tobacco smoke has been 
a primary etiological factor in lung cancer in the USA 
[176]. On average, lung cancer develops in cigarette 
smokers after 50 pack-years [177]. Tobacco smoke con¬ 
tains a large complex mixture of carcinogens, including 
many that may both initiate and promote the develop¬ 
ment of cancer. Cigarette smoke contains between 
3,000 and 4,000 toxic compounds, more than 300 of 
which are known or suspected carcinogens, including 
the majority of the IARC Group I chemical carcinogens 
discussed in this chapter [206] (see Table 1). Many of 
these toxic and carcinogenic components of tobacco 
smoke are readily absorbed systemically, and smoking 
is well associated with diseases such as cardiovascular 
disease and cancers in addition to lung [176, 177, 207]. 
Tobacco smoking is believed to be an important etio¬ 
logical factor in many cases of oral cancers, esophageal, 
bladder, pancreatic, kidney, and possibly breast and 
cervical cancers [208]. 

Many of the carcinogens in tobacco smoke alkylate 
or arylate DNA, producing promutagenic lesions. For 
example, benzo(a)pyrene is activated by the host 
cells’ cytochrome P450 1A1 enzyme activity to the 
ultimate carcinogen, benzo(a)pyrene-7,8-dihydrodiol- 
9,10-epoxide (BPDE) form, which reacts with nucleo¬ 
philes including the exocyclic amino group on 
deoxyguanine of DNA [209]. The BPDE-N 2 dG adduct 
is a promutagenic lesion that leads most frequently to 
a G to T transversion mutation. Numerous other 
tobacco smoke carcinogenic PAHs, heterocyclic 
amines, and nitrosamines, produce DNA adducts 
that can lead to primarily G to T transversions. 
As noted above, G to T transversions are the 
most common single base mutations in lung 
tumors [178, 179]. A few of these carcinogenic 
nitrosamines are tobacco smoke specific, that is, 
4-(methylnitrosamino)-1 -(3-pridyl)-1 -butanone (NNK) 
and N'-nitrosonornicotine (NNN) [207, 210] . The list of 
carcinogens in tobacco smoke is extensive and ensures 
multiple types of DNA damage and adverse impacts on 


other cellular macromolecules, enzymes, and structural 
components is complex [208]. It has been estimated 
that a developing preneoplastic clone of lung cells 
accumulates one DNA mutation for every 15 cigarettes 
smoked [211]. In addition to damaging DNA, most of 
these tobacco smoke toxicants also promote cellular 
turnover and inflammatory processes in lung tissue, 
providing another avenue of ROS and RNS species 
and altered microenvironments for clonal expansion 
of mutant cells [25, 43]. Particulates from tobacco 
smoke interfere with pulmonary function independent 
of the plethora of toxic chemicals, recovery and repair 
following exposure [148, 149]. In spite of all these 
genome destabilizing and tumor-promoting activities, 
many years of smoking history are generally required 
before a tumor develops, which is a testimony to the 
resilience of the human body to the bombardment of 
such a powerful combination of the carcinogens and 
toxic agents present in tobacco smoke. 

Future Directions 

It would be very difficult if not impossible for life on 
this planet to completely avoid exposure to environ¬ 
mental carcinogens. One can purposely avoid alcohol, 
tobacco smoke, and many occupational exposures, but 
the presence of carcinogenic substances in our food, air, 
water, and sunlight coupled with health, nutritional 
issues, and biological carcinogenic agents make 
avoiding all exposure to carcinogens very difficult. 
The extensive years of educating the public regarding 
the hazards of smoking, along with the banning of 
smoking in public buildings and other establishments 
has had a dramatic effect on the reduction in lung 
cancer and probably other cancers in recent years 
[176, 177]. Increased safety precautions and worker 
awareness to reduce occupational exposures remain 
a very important component in reducing cancer risk. 
Vaccinations against biological carcinogens, such as 
hepatitis B and C, may be important as well, and 
perhaps more important depending upon where one 
travels in the world. As people live longer and healthier 
lives, reductions in carcinogenic contaminants in our 
food, air, and water appears to be the only other viable 
avenue that society as a whole can take to diminish 
cancer risk. Individuals remain responsible for their 
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own nutritional health, exercise, and personal habits in 
avoiding carcinogenic exposures. 

Fortunately, our food also contains anticarcinogens 
[212-214] . For example, the isothiocyanates in broccoli 
are suggested to aid in diminishing the carcinogenic 
impacts of a number of chemical carcinogens [215]. 
This is a relatively young, but growing field of research 
that has the potential to provide viable strategies for 
long-term reductions in cancer risks. Individuals may 
only have to monitor their dietary intake of selected 
vegetables and herbs to enjoy significantly reduced 
cancer risk. However, this may work for reducing 
some types of cancer and not others. 

The relative contribution of endogenous versus 
exogenous causes of human cancer is still a matter of 
some debate. Since the carcinogenic impacts of endog¬ 
enous and exogenous are completely intertwined, sep¬ 
arating out individual contributions is very difficult. 
Only in selective cases where signature diseases and/or 
signature biomarkers such as specific DNA adducts 
and/or mutations are identifiable can the contributions 
be estimated in rough manner [116]. This is a problem 
that is likely to remain for many years to come. Learn¬ 
ing and understanding many of the xenobiotics in our 
food and environment to which humans are presently 
exposed continues to be a prominent field of safety 
research. Add in the large influx of synthetic chemical 
compounds joining our environmental melee each 
year, many of which may contribute to the carcinogenic 
impacts, and identifying all carcinogens in our envi¬ 
ronment becomes a rather daunting undertaking. For 
example, the explosion in designed nanoparticles for 
industrial and medical purposes, introduces potentially 
genotoxic agents into the realm of human exposure. At 
least some synthetic nanomaterials have thus far been 
found to cause chromosomal aberrations, DNA breaks, 
point mutations, and oxidative DNA damage [216]. 
Control of occupational exposures and environmental 
releases of these new genotoxic synthetic compounds 
will be important in opposing additions to the causes 
of environmental carcinogenesis. 

The lessons learned from the classical examples of 
environmental carcinogenesis (sunlight and skin can¬ 
cer, tobacco smoke and lung cancer, and alflatoxin, 
hepatitis and liver cancer) have guided research studies 
and advances in understanding cancer development. 


Continued advances in understanding the tissue, cellu¬ 
lar and molecular processes of carcinogenesis will 
enable the design of strategies to further diminish can¬ 
cer incidence and reduce risks throughout life. 

Bibliography 

1. Ferguson LR, Philpott M (2008) Nutrition and mutagenesis. 
Annu Rev Nutr 28:313-329 

2. Dybing E, O'Brien J, Renwick AG, Sanner T (2008) Risk assess¬ 
ment of dietary exposures to compounds that are genotoxic 
and carcinogenic-an overview. Toxicol Lett 180(2):110-117 

3. Hashimoto K, Higuchi M, Makino B, Sakakibara I, Kubo M, 
Komatsu Y, Maruno M, Okada M (1999) Quantitative analysis 
of aristolochic acids, toxic compounds, contained in some 
medicinal plants. J Ethnopharmacol 64(2):185-189 

4. Nortier JL, Martinez MC, Schmeiser HH, Arlt VM, Bieler CA, 
Petein M, Depierreux MF, De Pauw L, Abramowicz D, 
Vereerstraeten P, Vanherweghem JL (2000) Urothelial carci¬ 
noma associated with the use of a Chinese herb (Aristolochia 
fangchi). N Engl J Med 342(23):1686-1692 

5. Lai MN, Wang SM, Chen PC, Chen YY, Wang JD (2010) Popula¬ 
tion-based case-control study of Chinese herbal products 
containing aristolochic acid and urinary tract cancer risk. 
J Natl Cancer Inst 102(3):179-186 

6. Christman JK, Sheikhnejad G, Dizik M, Abileah S, Wainfan E 
(1993) Reversibility of changes in nucleic acid methylation and 
gene expression induced in rat liver by severe dietary methyl 
deficiency. Carcinogenesis 14(4)551-557 

7. Pufulete M, Al-Ghnaniem R, Rennie JA, Appleby P, Harris N, Gout 
S, Emery PW, Sanders TA (2005) Influence of folate status on 
genomic DNA methylation in colonic mucosa of subjects with¬ 
out colorectal adenoma or cancer. Br J Cancer 92(5):838-842 

8. Pogribny IP, Ross SA, Wise C, Pogribna M, Jones EA, Tryndyak VP, 
James SJ, Dragan YP, Poirier LA (2006) Irreversible global DNA 
hypomethylation as a key step in hepatocarcinogenesis induced 
by dietary methyl deficiency. Mutat Res 593(1-2):80-87 

9. Robinson JK (2005) Sun exposure, sun protection, and vitamin 
D. JAMA 294(12):1541-1543 

10. Rogers HW, Weinstock MA, Harris AR, Hinckley MR, Feldman 
SR, Fleischer AB, Coldiron BM (2010) Incidence estimate of 
nonmelanoma skin cancer in the United States, 2006. Arch 
Dermatol 146(3):283-287 

11. Kao SC, Reid G, Lee K, Vardy J, Clarke S, van Zandwijk N (2010) 
Malignant mesothelioma. Intern Med J 40(11):742-750 

12. Boffetta P, Hashibe M (2006) Alcohol and cancer. Lancet Oncol 
7(2):149—156 

13. Seitz HK, Stickel F (2007) Molecular mechanisms of alcohol- 
mediated carcinogenesis. Nat Rev Cancer 7(8)599-612 

14. Pelucchi C, Gallus S, Garavello W, Bosetti C, La Vecchia C (2008) 
Alcohol and tobacco use, and cancer risk for 
upper aerodigestive tract and liver. Eur J Cancer Prev 17(4): 
340-344 


E 




3614 


E 


Environmental Toxicology: Carcinogenesis 


15. Hecht SS, Kassie F, Hatsukami DK (2009) Chemoprevention of 
lung carcinogenesis in addicted smokers and ex-smokers. Nat 
Rev Cancer 9(7):476-488 

16. McKillop IH, Schrum LW (2009) Role of alcohol in liver carcino¬ 
genesis. Semin Liver Dis 29(2):222-232 

17. He S, Nakada D, Morrison SJ (2009) Mechanisms of stem cell 
self-renewal. Annu Rev Cell Dev Biol 25:377-406 

18. United States Cancer Statistics (USCS) (1999-2006) Cancer 
Incidence and Mortality Data, National Program of Cancer 
Registries (NPCR), Center for Disease Control and Prevention, 
Department of Health and Human Services, USA. http://apps. 
nccd.cdc.gov/uscs/index.aspx. Accessed March 2010 

19. Bielas JH, Loeb LA (2005) Mutator phenotype in cancer: timing 
and perspectives. Environ Mol Mutagen 45(2-3):206-213 

20. Stratton MR, Campbell PJ, Futreal PA (2009) The cancer 
genome. Nature 458(7239):719-724 

21. Ames BN (1989) Endogenous oxidative DNA damage, aging, 
and cancer. Free Radic Res Commun 7(3—6):121—128 

22. Loft S, Poulsen HE (1998) Estimation of oxidative DNA damage 
in man from urinary excretion of repair products. Acta Biochim 
Pol 45(1 ):133-144 

23. Shigenaga MK, Gimeno CJ, Ames BN (1989) Urinary 8-hydroxy- 
2'-deoxyguanosine as a biological marker of in vivo oxidative 
DNA damage. Proc Natl Acad Sci USA 86(24):9697-9701 

24. Cooke MS, Evans MD, Dizdaroglu M, Lunec J (2003) Oxidative 
DNA damage: mechanisms, mutation, and disease. FASEB 
J 17(10):1195-1214 

25. Hussain SP, Harris CC (2007) Inflammation and cancer: an ancient 
link with novel potentials. Int J Cancer 121(11 ):2373—2380 

26. Dahlmann HA, Vaidyanathan VG, Sturla SJ (2009) Investigating 
the biochemical impact of DNA damage with structure-based 
probes: abasic sites, photodimers, alkylation adducts, and 
oxidative lesions. Biochemistry 48(40):9347-9359 

27. Sedelnikova OA, Redon CE, Dickey JS, Nakamura AJ, 
Georgakilas AG, Bonner WM (2010) Role of oxidatively induced 
DNA lesions in human pathogenesis. Mutat Res. 

28. Yuan J, Narayanan L, Rockwell S, Glazer PM (2000) Diminished 
DNA repair and elevated mutagenesis in mammalian cells 
exposed to hypoxia and low pH. Cancer Res 60(16):4372-4376 

29. Camenisch U, Naegeli H (2009) Role of DNA repair in the 
protection against genotoxic stress. EXS 99:111 -150 

30. Goldsby RE, Lawrence NA, Hays LE, Olmsted EA, Chen X, Singh 
M, Preston BD (2001) Defective DNA polymerase-delta proof¬ 
reading causes cancer susceptibility in mice. Nat Med 
7(6):638-639 

31. Loeb LA, Bielas JH, Beckman RA (2008) Cancers exhibit 
a mutator phenotype: clinical implications. Cancer Res 
68(10):3551 -3557, discussion 3557 

32. Zegerman P, Diffley JF (2009) DNA replication as a target of the 
DNA damage checkpoint. DNA Repair (Amst) 8(9):1077-1088 

33. Hanahan D, Weinberg RA (2000) The hallmarks of cancer. Cell 
100(1):57-70 

34. van der Flier LG, Clevers H (2009) Stem cells, self-renewal, and 
differentiation in the intestinal epithelium. Annu Rev Physiol 
71:241-260 


35. Weinberg RA (2008) Mechanisms of malignant progression. 
Carcinogenesis 29(6):1092-1095 

36. Weinberg RA (2008) Coevolution in the tumor microenviron¬ 
ment. Nat Genet 40(5):494-495 

37. Chatten C, Bathe OF (2010) Immunoregulatory cells of the 
tumor microenvironment. Front Biosci 15:291-308 

38. Peto J (2001) Cancer epidemiology in the last century and the 
next decade. Nature 411 (6835)390-395 

39. Loeb LA, Harris CC (2008) Advances in chemical carcinogene¬ 
sis: a historical review and prospective. Cancer Res 
68(17):6863-6872 

40. Jiricny J (2006) The multifaceted mismatch-repair system. Nat 
Rev Mol Cell Biol 7(5)335-346 

41. McCulloch SD, Kunkel TA (2008) The fidelity of DNA synthesis 
by eukaryotic replicative and translesion synthesis polymer¬ 
ases. Cell Res 18(1 ):148-161 

42. Kunkel TA (2009) Evolving views of DNA replication (In)fidelity. 
Cold Spring Harbour Symp Quant Biol 74:91-101 

43. Hussain SP, He P, Subleski J, Hofseth LJ, Trivers GE, Mechanic L, 
Hofseth AB, Bernard M, Schwank J, Nguyen G, Mathe E, 
Djurickovic D, Haines D, Weiss J, Back T, Gruys E, Laubach VE, 
Wiltrout RH, Harris CC (2008) Nitric oxide is a key component 
in inflammation-accelerated tumorigenesis. Cancer Res 
68(17):7130-7136 

44. Fearon ER, Vogelstein B (1990) A genetic model for colorectal 
tumorigenesis. Cell 61 (5):759-767 

45. Blanquet V, Turleau C, Gross-Morand MS, Senamaud-Beaufort 
C, Doz F, Besmond C (1995) Spectrum of germline mutations 
in the RBI gene: a study of 232 patients with hereditary and 
non hereditary retinoblastoma. Hum Mol Genet 4(3)383-388 

46. Hussain SP, Harris CC (1998) Molecular epidemiology of 
human cancer: contribution of mutation spectra studies of 
tumor suppressor genes. Cancer Res 58(18):4023-4037 

47. Wilson VL (2001) Detecting rare mutations associated with 
cancer risk. Am J Pharmacogenomics 1(4):283-293 

48. Tapp RA, Feng J, Jones JW, Carlson JA, Wilson VL (2007) Single 
base instability is promoted in vulvar lichen sclerosus. J Invest 
Dermatol 127(11):2563-2576 

49. Petitjean A, Mathe E, Kato S, Ishioka C, Tavtigian SV, Hainaut P, 
Olivier M (2007) Impact of mutant p53 functional properties 
on TP53 mutation patterns and tumor phenotype: lessons 
from recent developments in the IARC TP53 database. Hum 
Mutat 28(6):622-629 

50. Stenson PD, Mort M, Ball EV, Howells K, Phillips AD, Thomas 
NS, Cooper DN (2009) The Human Gene Mutation Database: 
2008 update. Genome Med 1 (1 ):13 

51. Kumar R, Angelini S, Hemminki K (2003) Activating BRAF and 
N-Ras mutations in sporadic primary melanomas: an inverse 
association with allelic loss on chromosome 9. Oncogene 
22(58):9217-9224 

52. Apian PD (2006) Causes of oncogenic chromosomal translo¬ 
cation. Trends Genet 22(1):46-55 

53. MacCorkle RA, Slattery SD, Nash DR, Brinkley BR (2006) Intracel¬ 
lular protein binding to asbestos induces aneuploidy in human 
lung fibroblasts. Cell Motil Cytoskeleton 63(10):646-657 



Environmental Toxicology: Carcinogenesis 


3615 



54. Pavanello S, Pesatori AC, Dioni L, Hoxha M, Bollati V, Siwinska 
E, Mielzynska D, Bolognesi C, Bertazzi PA, Baccarelli A (2010) 
Shorter telomere length in peripheral blood lymphocytes of 
workers exposed to polycyclic aromatic hydrocarbons. Carci¬ 
nogenesis 31 (2):216—221 

55. Hirao T, Nelson HH, Ashok TD, Wain JC, Mark EJ, Christiani DC, 
Wiencke JK, Kelsey KT (2001) Tobacco smoke-induced DNA 
damage and an early age of smoking initiation induce chro¬ 
mosome loss at 3p21 in lung cancer. Cancer Res 61 (2):612-615 

56. Saletta F, Matullo G, Manuguerra M, Arena S, Bardelli A, Vineis 
P (2007) Exposure to the tobacco smoke constituent 
4-aminobiphenyl induces chromosomal instability in human 
cancer cells. Cancer Res 67(15):7088-7094 

57. Shinmura K, Iwaizumi M, Igarashi H, Nagura K, Yamada H, 
Suzuki M, Fukasawa K, Sugimura FI (2008) Induction of centro- 
some amplification and chromosome instability in p53- 
deficient lung cancer cells exposed to benzo[a]pyrene diol 
epoxide (B[a]PDE). J Pathol 216(3):365-374 

58. Groopman JD, Donahue PR, Zhu JQ, Chen JS, Wogan GN 
(1985) Aflatoxin metabolism in humans: detection of metab¬ 
olites and nucleic acid adducts in urine by affinity chromatog¬ 
raphy. Proc Natl Acad Sci USA 82(19):6492-6496 

59. Groopman JD, Wild CP, Hasler J, Junshi C, Wogan GN, Kensler 
TW (1993) Molecular epidemiology of aflatoxin exposures: 
validation of aflatoxin-N7-guanine levels in urine as 
a biomarker in experimental rat models and humans. Environ 
Health Perspect 99:107-113 

60. Essigmann JM, Wood ML (1993) The relationship between the 
chemical structures and mutagenic specificities of the DNA 
lesions formed by chemical and physical mutagens. Toxicol 
Lett 67(1-3):29-39 

61. Batzer MA, Deininger PL (2002) Alu repeats and human geno¬ 
mic diversity. Nat Rev Genet 3(5):370-379 

62. Konkel MK, Batzer MA (2010) A mobile threat to genome 
stability: The impact of non-LTR retrotransposons upon the 
human genome. Semin Cancer Biol 20(4):211-221 

63. Callinan PA, Batzer MA (2006) Retrotransposable elements and 
human disease. Genome Dyn 1:104— 115 

64. Jones PA, Liang G (2009) Rethinking how DNA methylation 
patterns are maintained. Nat Rev Genet 10(11 ):805—811 

65. Lister R, Pelizzola M, Dowen RH, Hawkins RD, Hon G, Tonti- 
Filippini J, Nery JR, Lee L, Ye Z, Ngo QM, Edsall L, Antosiewicz- 
Bourget J, Stewart R, Ruotti V, Millar AH, Thomson JA, Ren B, 
Ecker JR (2009) Human DNA methylomes at base resolution 
show widespread epigenomic differences. Nature 
462(7271 ):315-322 

66. Schubeler D (2009) Epigenomics: Methylation matters. Nature 
462(7271 ):296-297 

67. Guil S, Esteller M (2009) DNA methylomes, histone codes and 
miRNAs: tying it all together. Int J Biochem Cell Biol 41(1):87-95 

68. Lopez-Serra L, Esteller M (2008) Proteins that bind methylated 
DNA and human cancer: reading the wrong words. Br J Cancer 
98(12):1881-1885 

69. Esteller M (2008) Epigenetics in cancer. N Engl J Med 
358(11):1148-1159 


70. Sharma S, Kelly TK, Jones PA (2010) Epigenetics in cancer. 
Carcinogenesis 31(1 ):27—36 

71. Wilson VL, Jones PA (1983) Inhibition of DNA methylation by 
chemical carcinogens in vitro. Cell 32(1):239-246 

72. Wilson VL, Smith RA, Longoria J, Liotta MA, Harper CM, Harris 
CC (1987) Chemical carcinogen-induced decreases in genomic 
5-methyldeoxycytidine content of normal human bronchial 
epithelial cells. Proc Natl Acad Sci USA 84(10)3298-3301 

73. Chanda S, Dasgupta UB, Guhamazumder D, Gupta M, Chaudhuri 
U, Lahiri S, Das S, Ghosh N, Chatterjee D (2006) DNA hyper- 
methylation of promoter of gene p53 and pi 6 in arsenic-exposed 
people with and without malignancy. Toxicol Sci 89(2):431-437 

74. Bollati V, Baccarelli A, Hou L, Bonzini M, Fustinoni S, Cavallo D, 
Byun HM, Jiang J, Marinelli B, Pesatori AC, Bertazzi PA, Yang AS 
(2007) Changes in DNA methylation patterns in subjects 
exposed to low-dose benzene. Cancer Res 67(3):876-880 

75. Baccarelli A, Bollati V (2009) Epigenetics and environmental 
chemicals. Curr Opin Pediatr 21 (2):243-251 

76. Luger K, Mader AW, Richmond RK, Sargent DF, Richmond TJ 
(1997) Crystal structure of the nucleosome core particle at 2.8 
A resolution. Nature 389(6648):251-260 

77. Malik HS, Henikoff S (2003) Phylogenomics of the nucleosome. 
Nat Struct Biol 10(11):882-891 

78. Allfrey VG, Faulkner R, Mirsky AE (1964) Acetylation and Meth¬ 
ylation of Histones and Their Possible Role in the Regulation of 
Rna Synthesis. Proc Natl Acad Sci USA 51:786-794 

79. Kondo Y, Shen L, Yan PS, Huang TH, Issa JP (2004) Chromatin 
immunoprecipitation microarrays for identification of genes 
silenced by histone H3 lysine 9 methylation. Proc Natl Acad Sci 
USA 101 (19):7398-7403 

80. McKittrick E, Gafken PR, Ahmad K, Henikoff S (2004) Histone 
H3.3 is enriched in covalent modifications associated with 
active chromatin. Proc Natl Acad Sci USA 101 (6):1525-1530 

81. Kondo Y, Shen L, Ahmed S, Boumber Y, Sekido Y, Haddad BR, 
Issa JP (2008) Downregulation of histone H3 lysine 9 
methyltransferase G9a induces centrosome disruption and 
chromosome instability in cancer cells. PLoS ONE 3(4):e2037 

82. Yamada T, Mizuno K, Hirota K, Kon N, Wahls WP, Hartsuiker E, 
Murofushi H, Shibata T, Ohta K (2004) Roles of histone acety¬ 
lation and chromatin remodeling factor in a meiotic recombi¬ 
nation hotspot. EMBO J 23(8):1792-1803 

83. Toh GW, O'Shaughnessy AM, Jimeno S, Dobbie IM, Grenon M, 
Maffini S, O'Rorke A, Lowndes NF (2006) Histone H2A phos¬ 
phorylation and H3 methylation are required for a novel Rad9 
DSB repair function following checkpoint activation. DNA 
Repair (Amst) 5(6):693-703 

84. Shahbazian MD, Zhang K, Grunstein M (2005) Histone H2B 
ubiquitylation controls processive methylation but not 
monomethylation by Dotl and Setl. Mol Cell 19(2):271-277 

85. Iniguez-Lluhi JA (2006) For a healthy histone code, a little 
SUMO in the tail keeps the acetyl away. ACS Chem Biol 
1 (4):204-206 

86. Biel M, Wascholowski V, Giannis A (2005) Epigenetics-an epi¬ 
center of gene regulation: histones and histone-modifying 
enzymes. Angew Chem Int Ed Engl 44(21)3186-3216 


E 




3616 


Environmental Toxicology: Carcinogenesis 



87. Henikoff S (2005) Histone modifications: combinatorial com¬ 
plexity or cumulative simplicity? Proc Natl Acad Sci USA 
102(15):5308-5309 

88. Talasz H, Lindner HH, Sarg B, Helliger W (2005) Histone H4- 
lysine 20 monomethylation is increased in promoter and 
coding regions of active genes and correlates with hyperace¬ 
tylation. J Biol Chem 280(46)38814-38822 

89. Schneider J, Wood A, Lee JS, Schuster R, Dueker J, Maguire C, 
Swanson SK, Florens L, Washburn MP, Shilatifard A (2005) 
Molecular regulation of histone H3 trimethylation by COM¬ 
PASS and the regulation of gene expression. Mol Cell 
19(6):849-856 

90. Khalil AM, Driscoll DJ (2006) Histone H3 lysine 4 
dimethylation is enriched on the inactive sex chromosomes 
in male meiosis but absent on the inactive X in female 
somatic cells. Cytogenet Genome Res 112(1 —2):11—15 

91. Kondo Y, Shen L, Cheng AS, Ahmed S, Boumber Y, Charo C, 
Yamochi T, Urano T, Furukawa K, Kwabi-Addo B, Gold DL, 
Sekido Y, Huang TH, Issa JP (2008) Gene silencing in cancer by 
histone H3 lysine 27 trimethylation independent of promoter 
DNA methylation. Nat Genet 40(6):741-750 

92. Taverna SD, Li H, Ruthenburg AJ, Allis CD, Patel DJ 
(2007) How chromatin-binding modules interpret histone 
modifications: lessons from professional pocket pickers. Nat 
Struct Mol Biol 14(11):1025-1040 

93. Kurdistani SK, Grunstein M (2003) Histone acetylation 
and deacetylation in yeast. Nat Rev Mol Cell Biol 4(4): 
276-284 

94. Roth SY, Denu JM, Allis CD (2001) Histone acetyltransferases. 
Annu Rev Biochem 70:81-120 

95. Brownell JE, Zhou J, Ranalli T, Kobayashi R, Edmondson DG, 
Roth SY, Allis CD (1996) Tetrahymena histone 
acetyltransferase A: a homolog to yeast Gcn5p linking his¬ 
tone acetylation to gene activation. Cell 84(6):843-851 

96. Finnin MS, Donigian JR, Cohen A, Richon VM, Rifkind RA, 
Marks PA, Breslow R, Pavletich NP (1999) Structures of 
a histone deacetylase homologue bound to the TSA and 
SAHA inhibitors. Nature 401 (6749):188-193 

97. Gray SG, Ekstrom TJ (2001) The human histone deacetylase 
family. Exp Cell Res 262(2):75-83 

98. Dokmanovic M, Clarke C, Marks PA (2007) Histone 
deacetylase inhibitors: overview and perspectives. Mol Can¬ 
cer Res 5(10):981 —989 

99. Verdin E, Dequiedt F, Kasler HG (2003) Class II histone 
deacetylases: versatile regulators. Trends Genet 19(5): 
286-293 

100. Sengupta N, Seto E (2004) Regulation of histone deacetylase 
activities. J Cell Biochem 93(1):57-67 

101. Gutierrez RM, Hnilica LS (1967) Tissue specificity of histone 
phosphorylation. Science 157(794):1324-1325 

102. Wei Y, Mizzen CA, Cook RG, Gorovsky MA, Allis CD 
(1998) Phosphorylation of histone H3 at serine 10 is corre¬ 
lated with chromosome condensation during mitosis and 
meiosis in Tetrahymena. Proc Natl Acad Sci USA 
95(13):7480-7484 


103. Hirota T, Lipp JJ, Toh BH, Peters JM (2005) Histone H3 serine 
10 phosphorylation by Aurora B causes HP1 dissociation 
from heterochromatin. Nature 438(7071 ):1176-1180 

104. Goto H, Tomono Y, Ajiro K, Kosako H, Fujita M, Sakurai M, 
Okawa K, Iwamatsu A, Okigaki T, Takahashi T, Inagaki 
M (1999) Identification of a novel phosphorylation site on 
histone H3 coupled with mitotic chromosome condensation. 
J Biol Chem 274(36):25543-25549 

105. Preuss U, Landsberg G, Scheidtmann KH (2003) Novel mitosis- 
specific phosphorylation of histone H3 at Thrl 1 mediated by 
Dlk/ZIP kinase. Nucleic Acids Res 31 (3):878-885 

106. Barratt MJ, Hazzalin CA, Cano E, Mahadevan LC (1994) Mito¬ 
gen-stimulated phosphorylation of histone H3 is targeted to 
a small hyperacetylation-sensitive fraction. Proc Natl Acad Sci 
USA 91(11 ):4781 -4785 

107. Cheung P, Tanner KG, Cheung WL, Sassone-Corsi P, Denu JM, 
Allis CD (2000) Synergistic coupling of histone H3 phosphor¬ 
ylation and acetylation in response to epidermal growth 
factor stimulation. Mol Cell 5(6):905-915 

108. Clayton AL, Rose S, Barratt MJ, Mahadevan LC (2000) 
Phosphoacetylation of histone H3 on c-fos- and c-jun- 
associated nucleosomes upon gene activation. EMBO 
J 19(14)3714-3726 

109. Panier S, Durocher D (2009) Regulatory ubiquitylation in 
response to DNA double-strand breaks. DNA Repair (Amst) 
8(4):436-443 

110. Galanty Y, Belotserkovskaya R, Coates J, Polo S, Miller KM, 
Jackson SP (2009) Mammalian SUMO E3-ligases PIAS1 and 
PIAS4 promote responses to DNA double-strand breaks. 
Nature 462(7275):935-939 

111. Morris JR, Boutell C, Keppler M, Densham R, Weekes D, 
Alamshah A, Butler L, Galanty Y, Pangon L, Kiuchi T, Ng T, 
Solomon E (2009) The SUMO modification pathway is 
involved in the BRCA1 response to genotoxic stress. Nature 
462(7275):886-890 

112. An JY, Kim EA, Jiang Y, Zakrzewska A, Kim DE, Lee MJ, Mook- 
Jung I, Zhang Y, Kwon YT (2010) UBR2 mediates transcrip¬ 
tional silencing during spermatogenesis via histone 
ubiquitination. Proc Natl Acad Sci USA 107(5):1912—1917 

113. Grazini U, Zanardi F, Citterio E, Casola S, Goding CR, McBlane 
F (2010) The RING domain of RAG1 ubiquitylates histone H3: 
a novel activity in chromatin-mediated regulation of V(D)J 
joining. Mol Cell 37(2):282-293 

114. Lu LY, Wu J, Ye L, Gavrilina GB, SaundersTL, Yu X (2010) RNF8- 
dependent histone modifications regulate nucleosome 
removal during spermatogenesis. Dev Cell 18(3)371-384 

115. Mabruk MJ, Toh LK, Murphy M, Leader M, Kay E, Murphy GM 
(2009) Investigation of the effect of UV irradiation on DNA 
damage: comparison between skin cancer patients and nor¬ 
mal volunteers. J Cutan Pathol 36(7):760-765 

116. Wogan GN, Hecht SS, Felton JS, Conney AH, Loeb LA 
(2004) Environmental and chemical carcinogenesis. Semin 
Cancer Biol 14(6):473-486 

117. Lee WR, Perantie DC, Clark KB, Guillot DA, Wilson VL 
(2001) Effect of mutagen-induced cell lethality on the dose 



Environmental Toxicology: Carcinogenesis 


E 


3617 


response of germline mutations. Environ Mol Mutagen 
37(4):340-344 

118. Ellison KS, Dogliotti E, Connors TD, Basu AK, Essigmann JM 
(1989) Site-specific mutagenesis by 06-alkylguanines located 
in the chromosomes of mammalian cells: influence of the 
mammalian 06-alkylguanine-DNA alkyltransferase. Proc Natl 
Acad Sci USA 86(22):8620-8624 

119. Dosanjh MK, Singer B, Essigmann JM (1991) Comparative 
mutagenesis of 06-methylguanine and 04-methylthymine 
in Escherichia coli. Biochemistry 30(28):7027-7033 

120. Potten CS, Booth C (2002) Keratinocyte stem cells: 
a commentary. J Invest Dermatol 119(4):888-899 

121. Bapat SA (2007) Evolution of cancer stem cells. Semin Cancer 
Biol 17(3):204-213 

122. Orford KW, Scadden DT (2008) Deconstructing stem cell self¬ 
renewal: genetic insights into cell-cycle regulation. Nat Rev 
Genet 9(2):115-128 

123. Yahata T, Muguruma Y, Yumino S, Sheng Y, Uno T, 
Matsuzawa H, Ito M, Kato S, Hotta T, Ando K (2008) Quiescent 
human hematopoietic stem cells in the bone marrow niches 
organize the hierarchical structure of hematopoiesis. Stem 
Cells 26(12)3228-3236 

124. Karpowicz P, Morshead C, Kam A, Jervis E, Ramunas J, Cheng 
V, van der Kooy D (2005) Support for the immortal strand 
hypothesis: neural stem cells partition DNA asymmetrically in 
vitro. J Cell Biol 170(5):721-732 

125. LechlerT, Fuchs E (2005) Asymmetric cell divisions promote 
stratification and differentiation of mammalian skin. Nature 
437(7056):275-280 

126. Smith GH (2005) Label-retaining epithelial cells in mouse 
mammary gland divide asymmetrically and retain their tem¬ 
plate DNA strands. Development 132(4):681-687 

127. Cairns J (1975) Mutation selection and the natural history of 
cancer. Nature 255(5505):197-200 

128. Potten CS, Grant HK (1998) The relationship between ionizing 
radiation-induced apoptosis and stem cells in the small and 
large intestine. Br J Cancer 78(8):993-1003 

129. Lee JM, Dedhar S, Kalluri R, Thompson EW (2006) The epithe¬ 
lial-mesenchymal transition: new insights in signaling, devel¬ 
opment, and disease. J Cell Biol 172(7):973—981 

130. Rubin H (2001) The role of selection in progressive neoplastic 
transformation. Adv Cancer Res 83:159-207 

131. Negrini S, Gorgoulis VG, Halazonetis TD (2010) Genomic 
instability-an evolving hallmark of cancer. Nat Rev Mol Cell 
Biol 11 (3):220-228 

132. Beckman RA, Loeb LA (2006) Efficiency of carcinogenesis with 
and without a mutator mutation. Proc Natl Acad Sci USA 
103(38):14140-14145 

133. Chu WK, Hickson ID (2009) RecQ helicases: multifunctional 
genome caretakers. Nat Rev Cancer 9(9):644-654 

134. Lengauer C, Kinzler KW, Vogelstein B (1998) Genetic instabil¬ 
ities in human cancers. Nature 396(6712):643-649 

135. Ganem NJ, Godinho SA, Pellman D (2009) A mechanism 
linking extra centrosomes to chromosomal instability. Nature 
460(7252):278-282 


136. Weaver BA, Silk AD, Montagna C, Verdier-Pinard P, Cleveland 
DW (2007) Aneuploidy acts both oncogenically and as 
a tumor suppressor. Cancer Cell 11 (1 ):25-36 

137. Thompson SL, Compton DA (2008) Examining the link 
between chromosomal instability and aneuploidy in human 
cells. J Cell Biol 180(4):665-672 

138. Torres EM, Williams BR, Amon A (2008) Aneuploidy: cells 
losing their balance. Genetics 179(2):737-746 

139. Hussein MR, Wood GS (2002) Microsatellite instability and its 
relevance to cutaneous tumorigenesis. J Cutan Pathol 
29(5):257-267 

140. Dietmaier W, Wallinger S, Bocker T, Kullmann F, Fishel R, 
Ruschoff J (1997) Diagnostic microsatellite instability: defini¬ 
tion and correlation with mismatch repair protein expression. 
Cancer Res 57(21 ):4749-4756 

141. Boland CR, Thibodeau SN, Hamilton SR, Sidransky D, 
Eshleman JR, Burt RW, Meltzer SJ, Rodriguez-Bigas MA, 
Fodde R, Ranzani GN, Srivastava S (1998) A National Cancer 
Institute Workshop on Microsatellite Instability for cancer 
detection and familial predisposition: development 
of international criteria for the determination of microsatel¬ 
lite instability in colorectal cancer. Cancer Res 58(22): 
5248-5257 

142. Hoeijmakers JH (2001) Genome maintenance mechanisms 
for preventing cancer. Nature 411 (6835):366-374 

143. Jascur T, Boland CR (2006) Structure and function of the 
components of the human DNA mismatch repair system. 
Int J Cancer 119(9):2030-2035 

144. Thibodeau SN, French AJ, Roche PC, Cunningham JM, Tester 
DJ, Lindor NM, Moslein G, Baker SM, Liskay RM, Burgart LJ, 
Honchel R, Hailing KC (1996) Altered expression of hMSH2 
and hMLHI in tumors with microsatellite instability and 
genetic alterations in mismatch repair genes. Cancer Res 
56(21 ):4836-4840 

145. Hussein MR, Wood GS (2002) Microsatellite instability in 
human melanocytic skin tumors: an incidental finding or 
a pathogenetic mechanism? J Cutan Pathol 29(1 ):1 —4 

146. Fishel R, Lescoe MK, Rao MR, Copeland NG, Jenkins NA, 
Garber J, Kane M, Kolodner R (1993) The human mutator 
gene homolog MSH2 and its association with hereditary 
nonpolyposis colon cancer. Cell 75(5):1027-1038 

147. Leach FS, Nicolaides NC, Papadopoulos N, Liu B, Jen J, Par¬ 
sons R, Peltomaki P, Sistonen P, Aaltonen LA, Nystrom-Lahti 
M et al (1993) Mutations of a mutS homolog in hereditary 
nonpolyposis colorectal cancer. Cell 75(6):1215-1225 

148. Garcon G, Dagher Z, Zerimech F, Ledoux F, Courcot D, 
Aboukais A, Puskaric E, Shirali P (2006) Dunkerque City air 
pollution particulate matter-induced cytotoxicity, oxidative 
stress and inflammation in human epithelial lung cells (LI 32) 
in culture. Toxicol In Vitro 20(4):519-528 

149. Saint-Georges F, Garcon G, Escande F, Abbas I, Verdin A, Gosset 
P, Mulliez P, Shirali P (2009) Role of air pollution Particulate 
Matter (PM(2.5)) in the occurrence of loss of heterozygosity in 
multiple critical regions of 3p chromosome in human epithelial 
lung cells (LI 32). Toxicol Lett 187(3):172-179 


E 




3618 


Environmental Toxicology: Carcinogenesis 



150. Hirose T, Kondo K, Takahashi Y, Ishikura H, Fujino H, Tsuyuguchi 
M, Hashimoto M, Yokose T, Mukai K, Kodama T, Monden 
Y (2002) Frequent microsatellite instability in lung cancer from 
chromate-exposed workers. Mol Carcinog 33(3):172—180 

151. ZienolddinyS, Aguelon AM, Mironov N, Mathew B,Thomas G, 
Sankaranarayanan R, Yamasaki FI (2004) Genomic instability 
in oral squamous cell carcinoma: relationship to betel-quid 
chewing. Oral Oncol 40(3):298-303 

152. Parsons BL, Marchant-Miros KE, Delongchamp RR, Verkler TL, 
Patterson TA, McKinzie PB, Kim LT (2010) ACB-PCR quantifica¬ 
tion of K-RAS codon 12 GAT and GTT mutant fraction in colon 
tumor and non-tumor tissue. Cancer Invest 28(4):364-375 

153. Toyota M, Ahuja N, Ohe-Toyota M, Herman JG, Baylin SB, Issa 
JP (1999) CpG island methylator phenotype in colorectal 
cancer. Proc Natl Acad Sci USA 96(15):8681-8686 

154. Teodoridis JM, Hardie C, Brown R (2008) CpG island methyl¬ 
ator phenotype (CIMP) in cancer: causes and implications. 
Cancer Lett 268(2):177-186 

155. Tanemura A, Terando AM, Sim MS, van Hoesel AQ, de Maat 
MF, Morton DL, Hoon DS (2009) CpG island methylator phe¬ 
notype predicts progression of malignant melanoma. Clin 
Cancer Res 15(5):1801-1807 

156. Yagi K, Akagi K, Hayashi H, Nagae G, Tsuji S, Isagawa T, 
Midorikawa Y, Nishimura Y, Sakamoto H, Seto Y, Aburatani 
H, Kaneda A (2010) Three DNA methylation epigenotypes in 
human colorectal cancer. Clin Cancer Res 16(1 ):21 —33 

157. Gaudet F, Hodgson JG, Eden A, Jackson-Grusby L, Dausman J, 
Gray JW, Leonhardt H, Jaenisch R (2003) Induction of tumors 
in mice by genomic hypomethylation. Science 
300(5618):489-492 

158. Gama-Sosa MA, Slagel VA, Trewyn RW, Oxenhandler R, Kuo 
KC, Gehrke CW, Ehrlich M (1983) The 5-methylcytosine con¬ 
tent of DNA from human tumors. Nucleic Acids Res 
11 (19):6883-6894 

159. Goelz SE, Vogelstein B, Hamilton SR, Feinberg AP 
(1985) Hypomethylation of DNA from benign and malignant 
human colon neoplasms. Science 228(4696):187-190 

160. Kim Yl, Giuliano A, Hatch KD, Schneider A, Nour MA, Dallal GE, 
Selhub J, Mason JB (1994) Global DNA hypomethylation 
increases progressively in cervical dysplasia and carcinoma. 
Cancer 74(3):893-899 

161. Shahrzad S, Bertrand K, Minhas K, Coomber BL (2007) Induc¬ 
tion of DNA hypomethylation by tumor hypoxia. Epigenetics 
2(2):119-125 

162. Avila MA, Carretero MV, Rodriguez EN, Mato JM (1998) Regu¬ 
lation by hypoxia of methionine adenosyltransferase activity 
and gene expression in rat hepatocytes. Gastroenterology 
114(2)364-371 

163. Yanagawa N, Tamura G, Honda T, Endoh M, Nishizuka S, 
Motoyama T (2004) Demethylation of the synuclein gamma 
gene CpG island in primary gastric cancers and gastric cancer 
cell lines. Clin Cancer Res 10(7):2447-2451 

164. Laner T, Schulz WA, Engers R, Muller M, Florl AR 
(2005) Hypomethylation of the XIST gene promoter in pros¬ 
tate cancer. Oncol Res 15(5):257-264 


165. Fraga MF, Ballestar E, Paz MF, Ropero S, Setien F, Ballestar ML, 
Heine-Suner D, Cigudosa JC, Urioste M, Benitez J, Boix- 
Chornet M, Sanchez-Aguilera A, Ling C, Carlsson E, Poulsen 
P, Vaag A, Stephan Z, Spector TD, Wu YZ, Plass C, Esteller 
M (2005) Epigenetic differences arise during the lifetime 
of monozygotic twins. Proc Natl Acad Sci USA 102(30): 
10604-10609 

166. Kampranis SC, Tsichlis PN (2009) Histone demethylases and 
cancer. Adv Cancer Res 102:103-169 

167. Kristensen LS, Nielsen HM, Hansen LL (2009) Epigenetics and 
cancer treatment. Eur J Pharmacol 625(1-3):131-142 

168. Kinzler KW, Vogelstein B (1997) Cancer-susceptibility genes. 
Gatekeepers and caretakers. Nature 386(6627):761-763 

169. Bardelli A, Cahill DP, Lederer G, Speicher MR, Kinzler KW, 
Vogelstein B, Lengauer C (2001) Carcinogen-specific induc¬ 
tion of genetic instability. Proc Natl Acad Sci USA 
98(10)3770-5775 

170. Palmero El, Achatz Ml, Ashton-Prolla P, Olivier M, Hainaut 
P (2010) Tumor protein 53 mutations and inherited cancer: 
beyond Li-Fraumeni syndrome. Curr Opin Oncol 22(1):64-69 

171. Amberger J, Bocchini CA, Scott AF, Hamosh A (2009) 
McKusick's Online Mendelian Inheritance in Man (OMIM). 
Nucleic Acids Res. 37(Database issue): D793-D796 

172. Cazier JB, Tomlinson I (2010) General lessons from large-scale 
studies to identify human cancer predisposition genes. 
J Pathol 220(2):255-262 

173. Gresner P, Gromadzinska J, Wasowicz W (2007) Polymorphism 
of selected enzymes involved in detoxification and biotransfor¬ 
mation in relation to lung cancer. Lung Cancer 57(1 ):1 —25 

174. Cote ML, Wenzlaff AS, Bock CH, Land SJ, Santer SK, Schwartz 
DR, Schwartz AG (2007) Combinations of cytochrome P-450 
genotypes and risk of early-onset lung cancer in Caucasians 
and African Americans: a population-based study. Lung Can¬ 
cer 55(3):255-262 

175. Schwartz AG, Prysak GM, Bock CH, Cote ML (2007) The molecular 
epidemiology of lung cancer. Carcinogenesis 28(3)307-518 

176. Cokkinides V, Bandi P, McMahon C, Jemal A, Glynn T, Ward 
E (2009) Tobacco control in the United States-recent pro¬ 
gress and opportunities. CA Cancer J Clin 59(6)352-365 

177. Lubin JH, Alavanja MC, Caporaso N, Brown LM, Brownson RC, 
Field RW, Garcia-Closas M, Hartge P, Hauptmann M, Hayes RB, 
Kleinerman R, Kogevinas M, Krewski D, Langholz B, 
Letourneau EG, Lynch CF, Malats N, Sandler DP, Schaffrath- 
Rosario A, Schoenberg JB, Silverman DT, Wang Z, Wichmann 
HE, Wilcox HB, Zielinski JM (2007) Cigarette smoking and 
cancer risk: modeling total exposure and intensity. Am 
J Epidemiol 166(4):479-489 

178. Robles Al, Linke SP, Harris CC (2002) The p53 network in lung 
carcinogenesis. Oncogene 21(45):6898-6907 

179. Olivier M, Hollstein M, Hainaut P (2010) TP53 Mutations in 
Human Cancers: Origins, Consequences, and Clinical Use. 
Cold Spring Harb Perspect Biol 2(1):a001008 

180. Denissenko MF, Pao A, Tang M, Pfeifer GP (1996) Preferential 
formation of benzo[a]pyrene adducts at lung cancer muta¬ 
tional hotspots in P53. Science 274(5286):430-432 



Environmental Toxicology: Carcinogenesis 


E 


3619 


181. Greenblatt MS, Bennett WP, Hollstein M, Harris CC 
(1994) Mutations in the p53 tumor suppressor gene: clues 
to cancer etiology and molecular pathogenesis. Cancer Res 
54(18):4855-4878 

182. Bennett WP, Hussain SP, Vahakangas KH, Khan MA, Shields 
PG, Harris CC (1999) Molecular epidemiology of human can¬ 
cer risk: gene-environment interactions and p53 mutation 
spectrum in human lung cancer. J Pathol 187(1 ):8—18 

183. Hsu 1C, Metcalf RA, Sun T, Welsh JA, Wang NJ, Harris CC 
(1991) Mutational hotspot in the p53 gene in human hepa¬ 
tocellular carcinomas. Nature 350(6317):427-428 

184. Soini Y, Chia SC, Bennett WP, Groopman JD, Wang JS, 
DeBenedetti VM, Cawley H, Welsh JA, Hansen C, Bergasa 
NV, Jones EA, DiBisceglie AM, Trivers GE, Sandoval CA, 
Calderon IE, Munoz Espinosa LE, Harris CC (1996) An afla- 
toxin-associated mutational hotspot at codon 249 in the p53 
tumor suppressor gene occurs in hepatocellular carcinomas 
from Mexico. Carcinogenesis 17(5):1007—1012 

185. Gouas D, Shi H, Hainaut P (2009) The aflatoxin-induced TP53 
mutation at codon 249 (R249S): biomarker of exposure, 
early detection and target for therapy. Cancer Lett 286(1): 
29-37 

186. Brash DE, Rudolph JA, Simon JA, Lin A, McKenna GJ, Baden 
HP, Halperin AJ, Ponten J (1991) A role for sunlight in skin 
cancer: UV-induced p53 mutations in squamous cell carci¬ 
noma. Proc Natl Acad Sci USA 88(22):10124-10128 

187. Armstrong BK, Kricker A (2001) The epidemiology of UV 
induced skin cancer. J Photochem Photobiol B 63(1 —3):8—18 

188. RungerTM (2007) How different wavelengths of the ultraviolet 
spectrum contribute to skin carcinogenesis: the role of cellular 
damage responses. J Invest Dermatol 127(9):2103-2105 

189. Fang JL, Vaca CE (1997) Detection of DNA adducts of acetal¬ 
dehyde in peripheral white blood cells of alcohol abusers. 
Carcinogenesis 18(4):627-632 

190. Vaca CE, Nilsson JA, Fang JL, Grafstrom RC (1998) Formation 
of DNA adducts in human buccal epithelial cells exposed to 
acetaldehyde and methylglyoxal in vitro. Chem Biol Interact 
108(3):197-208 

191. Clemens DL, Forman A, Jerrells TR, Sorrell MF, Tuma DJ 
(2002) Relationship between acetaldehyde levels and cell 
survival in ethanol-metabolizing hepatoma cells. Hepatology 
35(5):1196-1204 

192. Molina PE, Hoek JB, Nelson S, Guidot DM, Lang CH, Wands JR, 
Crawford JM (2003) Mechanisms of alcohol-induced tissue 
injury. Alcohol Clin Exp Res 27(3):563-575 

193. el Ghissassi F, Barbin A, Nair J, Bartsch H (1995) Formation of 
1,N6-ethenoadenine and 3,N4-ethenocytosine by lipid per¬ 
oxidation products and nucleic acid bases. Chem Res Toxicol 
8(2):278-283 

194. Giovannucci E (2004) Alcohol, one-carbon metabolism, and 
colorectal cancer: recent insights from molecular studies. 
J Nutr 134(9):2475S-2481S 

195. Giovannucci E (2002) Epidemiologic studies of folate and 
colorectal neoplasia: a review. J Nutr 132(8 Suppl): 
2350S-2355S 


196. Leiter U, Garbe C (2008) Epidemiology of melanoma and 
nonmelanoma skin cancer-the role of sunlight. Adv Exp 
Med Biol 624:89-103 

197. Ridley AJ, Whiteside JR, McMillan TJ, Allinson SL (2009) Cellu¬ 
lar and sub-cellular responses to UVA in relation to carcino¬ 
genesis. Int J Radiat Biol 85(3):177-195 

198. Matsunaga T, Hieda K, Nikaido O (1991) Wavelength depen¬ 
dent formation of thymine dimers and (6-4) photoproducts 
in DNA by monochromatic ultraviolet light ranging from 
150 to 365 nm. Photochem Photobiol 54(3):403-410 

199. Lacour JP (2002) Carcinogenesis of basal cell carcinomas: 
genetics and molecular mechanisms. Br J Dermatol 
146(Suppl 61 ):17-19 

200. Agar NS, Halliday GM, Barnetson RS, Ananthaswamy HN, 
Wheeler M, Jones AM (2004) The basal layer in human squa¬ 
mous tumors harbors more UVA than UVB fingerprint muta¬ 
tions: a role for UVA in human skin carcinogenesis. Proc Natl 
Acad Sci USA 101(14):4954-4959 

201. Kuhn C, Hurwitz SA, Kumar MG, Cotton J, Spandau DF 
(1999) Activation of the insulin-like growth factor-1 receptor 
promotes the survival of human keratinocytes following 
ultraviolet B irradiation. Int J Cancer 80(3):431-438 

202. Lewis DA, Spandau DF (2007) UVB activation of NF-kappaB in 
normal human keratinocytes occurs via a unique mechanism. 
Arch Dermatol Res 299(2):93-101 

203. Lewis DA, Yi Q, Travers JB, Spandau DF (2008) UVB-induced 
senescence in human keratinocytes requires a functional 
insulin-like growth factor-1 receptor and p53. Mol Biol Cell 
19(4):1346-1353 

204. Tavakkol A, Elder JT, Griffiths CE, Cooper KD, Talwar H, Fisher 
GJ, Keane KM, Foltin SK, Voorhees JJ (1992) Expression of 
growth hormone receptor, insulin-like growth factor 1 (IGF-1) 
and IGF-1 receptor mRNA and proteins in human skin. 
J Invest Dermatol 99(3):343-349 

205. Ferber A, Chang C, Sell C, Ptasznik A, Cristofalo VJ, Hubbard K, 
Ozer HL, Adamo M, Roberts CT Jr, LeRoith D et al (1993) 
Failure of senescent human fibroblasts to express the 
insulin-like growth factor-1 gene. J Biol Chem 268: 
17883-17888 

206. Agents Classified by the IARC Monographs 1-100, Updated 
30 August 2010; IARC Monographs on the Evaluation of 
Carcinogenic Risks to Humans, http://monographs.iarc.fr/ 
ENG/Classification/index.php. Accessed November 2010 

207. Hecht SS, Carmella SG, Murphy SE, Akerkar S, Brunnemann 
KD, Hoffmann D (1993) A tobacco-specific lung carcinogen in 
the urine of men exposed to cigarette smoke. N Engl J Med 
329(21 ):1543-1546 

208. Smoking and Tobacco Control Monograph 10, Health Effects 
of Exposure to Environmental Tobacco Smoke (August 1999), 
National Cancer Institute, National Institutes of Health, 
Bethesda, MD 

209. Alexandrov K, Rojas M, Kadlubar FF, Lang NP, Bartsch 
H (1996) Evidence of anti-benzo[a]pyrene diolepoxide-DNA 
adduct formation in human colon mucosa. Carcinogenesis 
17(9):2081-2083 


E 




3620 


E 


Environmental Toxicology: Children at Risk 


210. Hecht SS, Hoffmann D (1988) Tobacco-specific nitrosamines, 
an important group of carcinogens in tobacco and tobacco 
smoke. Carcinogenesis 9(6):875-884 

211. Pleasance ED, Stephens PJ, O'Meara S, McBride DJ, Meynert 
A, Jones D, Lin ML, Beare D, Lau KW, Greenman C, Varela I, 
Nik-Zainal S, Davies HR, Ordonez GR, Mudie LJ, Latimer C, 
Edkins S, Stebbings L, Chen L, Jia M, Leroy C, Marshall J, 
Menzies A, Butler A, Teague JW, Mangion J, Sun YA, 
McLaughlin SF, Peckham HE, Tsung EF, Costa GL, Lee CC, 
Minna JD, Gazdar A, Birney E, Rhodes MD, McKernan KJ, 
Stratton MR, Futreal PA, Campbell PJ (2010) A small-cell 
lung cancer genome with complex signatures of tobacco 
exposure. Nature 463(7278):184-190 

212. McCullough ML, Giovannucci EL (2004) Diet and cancer pre¬ 
vention. Oncogene 23(38):6349-6364 

213. Kensler TW, Groopman JD (2009) Is it time to advance the 
chemoprevention of environmental carcinogenesis with 
microdosing trials? Cancer Prev Res (Phila Pa) 2(12):1003—1007 

214. Ju J, Picinich SC, Yang Z, Zhao Y, Suh N, Kong AN, Yang CS 
(2010) Cancer-preventive activities of tocopherols and 
tocotrienols. Carcinogenesis 31(4):533-542 

215. Fowke JH (2007) Head and neck cancer: a case for inhibition 
by isothiocyanates and indoles from cruciferous vegetables. 
Eur J Cancer Prev 16(4):348-356 

216. Singh N, Manshian B, Jenkins GJ, Griffiths SM, Williams PM, 
Maffeis TG, Wright CJ, Doak SH (2009) NanoGenotoxicology: 
the DNA damaging potential of engineered nanomaterials. 
Biomaterials 30(23-24)3891-3914 


Environmental Toxicology: Children 
at Risk 

Maria J. Carroquino 1 , M. Posada 1 , P. J. Landrigan 2 

^nstituto de Salud Carlos III, Madrid, Spain 
department of Preventive Medicine, Mount Sinai 
School of Medicine, New York, NY, USA 


Article Outline 

Glossary 

Definition of the Subject 

Introduction: Brief History of the Issue of Childrens 
Vulnerability to Environmental Hazards 
Toxicological Basis of Childrens Vulnerability to 
Environmental Hazards: Susceptibility and 
Exposure Factors Affecting Children s Vulnerability 
Major Groups of Pollutants to Which Children are 
Exposed 


Environmental Threats to Children on Specific Organ 
Systems 

Future Directions 

Bibliography 

Glossary 

Adult The time of life usually starting at 18 years 
(some systems such as skeleton and brain may 
continue to develop). 

Adverse effect A treatment-related alteration from 
baseline that diminishes an organism’s ability to 
survive, reproduce, or adapt to the environment. 

Carcinogen Any substance that can cause cancer. 

Critical period A specific phase during which 
a developing system is particularly susceptible. 

Developmental disorders/effects Adverse effects such 
as altered growth, structural abnormality, functional 
deficiency, or death observed in a developing 
organism. 

Dose (exposure)-response relationship Characteriza¬ 
tion of the relationship between administered dose 
or exposure and the biological change in organisms. 

Embryonic period The period from fertilization to 
the end of major organogenesis. 

Environmental exposures and environmental 
hazards For this entry, these terms refer to specific 
environmental chemicals and environmental 
pollutants. 

Exposure Contact with a chemical by swallowing, by 
breathing, or by direct contact (such as through the 
skin or eyes). Exposure may be short term (acute) 
or long term (chronic). 

Fetus The period from 8 weeks of pregnancy to birth. 

Gestation Length of time between conception and 
birth. 

Infant The period from 28 days of age to 1 year. 

Lowest observed adverse effect level (LOAEL) The 
lowest concentration of a chemical in a study, or 
group of studies, that produces statistically or 
biologically significant increase in the frequency 
or severity of adverse effects between the exposed 
population and an appropriate control. 

Mechanism of action The detailed molecular knowl¬ 
edge of the key events leading to an adverse effect in 
an organism. 

Neonate The period from birth to 28 days of age. 
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Perinatal stage The period of 29 weeks of pregnancy 
to 7 days after birth. 

Pregnancy The condition of having an implanted 
embryo or fetus in the body, after fusion of an 
ovum and spermatozoon. 

Preterm birth A birth occurring at 24-37 weeks of 
pregnancy. 

Risk assessment An empirically based paradigm that 
estimates the risk of adverse effect(s) from exposure 
of an individual or population to a chemical, phys¬ 
ical, or biological agent. It includes the components 
of hazard identification, assessment of dose- 
response relationships, exposure assessment, and 
risk characterization. 

Route of exposure Exposure route refers to the differ¬ 
ent ways a substance may enter the body. The route 
may be dermal, ingestion, or inhalation. 

Sexual maturation Achievement of full development 
of the reproductive system and sexual function. 
Susceptibility An individual’s intrinsic or acquired 
traits that modify the risk of illness (e.g., high sus¬ 
ceptibility to cancer). 

Toxicokinetics The process of the uptake of poten¬ 
tially toxic substances by the body, the biotransfor¬ 
mation they undergo, the distribution of the 
substances and their metabolites in the tissues, 
and the elimination of the substances and their 
metabolites from the body. Both the amounts and 
the concentrations of the substances and their 
metabolites are studied. (Pharmacokinetics is the 
term used to study pharmaceutical substances.) 
Vulnerability A matrix of physical, chemical, biologi¬ 
cal, social, and cultural factors that result in certain 
communities and subpopulations being more 
susceptible to environmental factors because of 
greater exposure to such factors or a compromised 
ability to cope with and/or recover from such 
exposure. 

Definition of the Subject 

Children today live in a world that is vastly different 
from a few generations ago. While industrialization has 
maximized (for many) children’s opportunities to sur¬ 
vive, develop and enjoy high levels of health, education, 
recreation, and fulfillment, it has also added significant 
challenges to their development. 


In the countries in the Organization for Economic 
Co-operation and Development (OECD), an infant 
born today has about 20 years longer life expectancy 
than one born at the beginning of the twentieth century 
[1]. Infant mortality has gone down by over 90%. 
Despite AIDS, SARS, West Nile virus, and the constant 
threat of other emerging infections, the ancient epi¬ 
demics of smallpox, yellow fever, cholera, bubonic 
plague, polio, and measles are no longer the dominant 
causes of disease and death. However, an increase in the 
incidence (new cases) of many chronic diseases in chil¬ 
dren has been observed. These include asthma, cancer 
(which is the second leading cause of death in children 
after injuries), birth defects, developmental disabilities, 
and autism. For some of these conditions, an environ¬ 
mental origin has been established, and for others it is 
hypothesized. 

Environmental threats to children’s health range 
from asthma-inducing air pollution and lead-based 
paint in older homes, to treatment-resistant microbes 
in drinking water and persistent industrial chemicals 
that may cause cancer or result in reproductive or 
developmental changes. 

Patterns of illness among children in the industri¬ 
ally developed nations have changed substantially. 
Infant mortality has declined. Life expectancy has 
increased. However, an increase in the incidence (new 
cases) of some diseases, such as asthma, birth defects, 
neurodevelopmental disorders and certain types of 
childhood and adolescence cancers, and obesity has 
been observed. Evidence is accumulating that toxic 
chemicals are responsible for at least some of these 
changing patterns of disease. 

Children are uniquely susceptible to chemicals. 
This great vulnerability reflects the juxtaposition of 
two phenomena early in life: first, that infants and 
children have disproportionately greater exposures 
than adults to environmental chemicals, and second, 
that children are exquisitely sensitive to these expo¬ 
sures, because they are poorly equipped to metabolize 
many toxic compounds and because they are 
progressing through the complex, delicate, and easily 
disrupted stages of early development. The protection 
of children from environmental health hazards requires 
the consideration of their exposure patterns and sus¬ 
ceptibility factors when conducting risk assessments, 
development of child protective legislation, and wider 
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application of the Precautionary Principle in the face of 
early warning of danger. 

Introduction: Brief History of the Issue of 
Children's Vulnerability to Environmental 
Hazards 

Children s risks from environmental health threats have 
received considerable political attention during the last 
two decades. In 1989, the United Nations Convention on 
the Rights of the Child laid down basic standards for the 
protection of children and proclaimed that they 
are entitled to special care and assistance. A year later, 
the World Summit for Children (WSC) adopted a Dec¬ 
laration on the Survival, Protection and Development 
for Children, in which the signatories agreed to join 
efforts on taking measures to protect the environment, 
so that all children can enjoy a safer and healthier future. 

Agenda 21, adopted in 1992 at the United Nations 
Conference on Environment and Development 
(“the Earth Summit”) gives attention to the protection 
of children from the effects of a deteriorating environ¬ 
ment in several chapters. Chapter 6 of Agenda 21 
“Protecting and Promoting Human Health” empha¬ 
sizes the need to pay special attention to protecting 
vulnerable groups, particularly infants, young people, 
women, indigenous people, and the poor. Agenda 21 
urges governments to develop programs to protect 
children from the effects of environmental and occu¬ 
pational toxic compounds. 

In the USA, the National Research Council report 
[2] “Pesticides in the Diet of Infants and Children” was 
critical in raising awareness about the importance in 
risk assessment of childrens environmental health. 
This report elevated concern on a broad national level 
about childrens special vulnerabilities to environmen¬ 
tal agents. It made clear that protection of the health of 
vulnerable populations would require a new approach 
to risk assessment. The NRC report recommended an 
approach to risk assessment that moved beyond con¬ 
sideration of average exposures based primarily on 
adult characteristics to one that accounted for the het¬ 
erogeneity of exposures and for potential differential 
sensitivities of various life stages, particularly during 
prenatal development, infancy, and childhood. 

Responding to recommendations in the NRC 
report [2], the US Government took decisive steps to 


attend to the growing concern on children susceptibil¬ 
ity to environmental toxicants. In 1995, the EPA issued 
a National Policy to consistently and explicitly take into 
account health risks to children and infants from envi¬ 
ronmental hazards when conducting assessments of 
environmental risks. In 1997, the Clinton Administra¬ 
tion issued an Executive Order, Executive Order 13045: 
Protection of Children from Environmental Health 
Risks and Safety Risks. The Executive Order requires 
all federal agencies to address health and safety risks to 
children, to coordinate research priorities on childrens 
health, and to ensure that their standards take into 
account special risks to children. To implement the 
order, the US EPA established the Office of Childrens 
Health Protection (OCHP) (renamed the Office of 
Childrens Health Protection and Environmental Edu¬ 
cation - OCHPEE - in 2005), whose job is to work with 
Program and Regional Offices within the US EPA to 
promote a safe and healthy environment for children 
by ensuring that all regulations, standards, policies, and 
risk assessments take into account risks to children. 
Legislation, such as the Food Quality Protection Act 
and the Safe Drinking Water Act amendments, has 
made coverage of children s health issues more explicit, 
and research on children s health issues is continually 
expanding. As a result of the emphasis on childrens 
risk, the US EPA Office of Research and Development 
(ORD) developed a strategy for research on environ¬ 
mental risks to children. The goal of this research 
agenda is to discover the environmental causes of dis¬ 
ease in children and then to convert these research 
findings into blueprints for disease prevention and 
health promotion. 

The 1997 Declaration of the Environmental 
Leaders of the Eight on Children’s Environmental 
Health intensified their commitment to protecting 
children’s health from environmental hazards. The 
Environment Ministers of the G8 countries acknowl¬ 
edged the special vulnerabilities of children and com¬ 
mitted their countries to taking action on several 
specific environmental health issues such as lead, 
microbiologically safe drinking water, endocrine 
disrupting chemicals, environmental tobacco smoke 
(ETS), and air quality. They called on financial insti¬ 
tutions, the World Health Organization (WHO), the 
United Nations Environment Program (UNEP), and 
other international bodies to continue ongoing 
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activities and to pay further attention to childrens 
environmental health, in particular the economic 
and social dimensions of childrens health. In addi¬ 
tion, they committed their countries to fulfilling and 
to promoting the Organization for Economic Co¬ 
operation and Development (OECD) Declaration on 
Risk Reduction for Lead. The underlying rationale for 
each of these actions is that disease of environmental 
origin in children can be prevented by controlling 
harmful exposures in the environment. 

International organizations such as the World 
Health Organization (WHO), the United Nations 
Environment Program (UNEP), and the European 
Union (EU) responded to this call and also developed 
initiatives on Children, Environment and Health. Chil- 
dren’s Health and Environment has been the central 
focus of Europe wide Ministerial Conferences orga¬ 
nized by WHO, and other EU initiatives, such as 
SCALE (Science, Children, Awareness, Legal Instru¬ 
ment, Evaluation). Thus, the protection of children 
from environmental hazards has been a subject of 
intense political and scientific attention and effort dur¬ 
ing the last two decades. Since the initial reports on this 
issue, research has intensified, and scientific literature 
abounds with new findings on childrens differential 
vulnerability to environmental health threats that 
have provided the basis to maintaining the political 
attention on children s health. 

Toxicological Basis of Children's Vulnerability to 
Environmental Hazards: Susceptibility and 
Exposure Factors Affecting Children's 
Vulnerability 

Over the past 2 decades, a number of scientific reports 
have integrated the knowledge available on the suscep¬ 
tibility of children to environmental toxicants, have 
highlighted knowledge gaps, and have pointed to 
future directions of research to fill these gaps. This 
has expanded and deepened the knowledge of the dif¬ 
ferential susceptibility of children to environmental 
toxicants, and at the same time has allowed the firm 
establishment of certain fundamentals concerning chil¬ 
dren’s susceptibility to environmental toxicants. 

The International Life Science Institute (ILSI) and 
the EPA, recognizing the need for examining the scien¬ 
tific evidence on the broad question of the differential 


susceptibility of children to environmental hazards, 
organized in 1990 a conference titled: “Similarities 
and Differences Between Children and Adults: Impli¬ 
cations for Risk Assessment.” The results from this 
conference were summarized in the publication “Sim¬ 
ilarities and Differences Between Children and Adults” 
[3] and was incorporated into ongoing work of the 
National Research Committee on Pesticides in the 
Diets of Infants and Children. The National Academy 
of Science (NAS) published the NRC report “Pesticides 
in the Diets of Infants and Children” [2], which was 
critical in raising awareness of the importance of con¬ 
sidering the vulnerable life stages of children when 
conducting risk assessment of exposures to children. 
In 2001, the International Life Sciences Institute con¬ 
vened a number of scientific experts to develop 
a conceptual framework for conducting health risk 
assessments of children exposures, which takes into 
consideration their unique characteristics and special 
vulnerabilities [4]. 

The ILSI report highlighted the fact that “children 
are not little adults,” but a unique subpopulation that 
needs to be considered in risk assessment due to differ¬ 
ential exposure patterns, immaturity in physiological 
development, or differential toxicant metabolism. 

In 2006, the WHO published the Principles for 
Evaluating the Health Risks in Children Associated 
with Exposure to Chemicals in the Environmental 
Health Criteria 237 [5]. This publication constitutes 
the most recent monographic work of the subject of 
children susceptibility to environmental toxicants, and 
provides comprehensive information on childrens 
developmental stages and the critical windows of sus¬ 
ceptibility that appear through the course of their 
development. 

In summary, it is clear from these three major scien¬ 
tific reports - NAS, ILSI, and WHO - that there is broad 
scientific consensus that risks to children from environ¬ 
mental health threats differ qualitatively or quantita¬ 
tively from those of adults for a variety of reasons: 

• Differential exposure patterns: Compared to adults, 
children have heavier exposures in relation to body 
weight because they drink more water, eat 
more food, and have higher breathing rates per 
unit of body weight than adults do. As a conse¬ 
quence, children have substantially heavier 
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exposures than adults to any toxicants that are 
present in water, food, or air. 

• Childrens ability to metabolize, detoxify, and 
excrete chemicals is different from that of adults. 
During the first months after birth, their metabolic 
pathways are immature. In some cases, children 
may actually have a higher metabolic capacity for 
some toxicants than adults. Commonly, however, 
they are less able to deal with toxic chemicals and 
thus are more vulnerable to them. 

• Children undergo rapid growth and development, 
and their development phases are perfectly sched¬ 
uled to achieve complete functional development. If 
a development phase is disturbed at a given time, its 
time to take place may be lost definitively. Thus, 
interferences with certain phases of development 
may have irreversible effects. If cells in a child’s 
developing brain are destroyed by chemicals such 
as lead, mercury, or solvents, or if false signals are 
sent to the developing reproductive organs by endo¬ 
crine disruptors, there is a high risk that the resulting 
dysfunction will be permanent and irreversible. 

• Children’s exposures are affected qualitatively and 
quantitatively by their behavior and the unique 
microenvironment in which they spend their time. 
Their hand-to-mouth behavior brings contami¬ 
nated items or soil to their mouth, and living and 
playing closer to the ground also exposes them to 
pollutants in a different pattern than adults, both in 
quantitative and qualitative terms. 

• Because children generally have more years of life 
ahead than adults, they have more time to develop 
chronic diseases triggered by early exposures. Many 
diseases, such as cancer and neurodegenerative 
diseases, are thought to arise through a series of 
stages that require years or even decades from ini¬ 
tiation to actual manifestation of disease. Carcino¬ 
genic and toxic exposures, sustained early in life, 
including prenatal exposures, would then be more 
likely to lead to disease than similar exposures 
encountered later. 

The purpose of this chapter has been to select, 
integrate, and summarize the most relevant and illus¬ 
trative information that explains and highlights the 
potential susceptibility of children to environmental 
exposures. The goal of the chapter is to promote 


understanding of the issue of differential susceptibility 
and exposures of children to environmental threats 
among scientists from adjacent fields, and to encourage 
scientists to consider the issue of children’s differential 
susceptibility and exposures in their research. Recogni¬ 
tion of children’s exquisite vulnerability to toxic expo¬ 
sures in the environment is critical to child-protective 
risk assessment and to disease prevention. 

For a more in-depth study of any of the particular 
health issues or particular toxicants, extensive and 
rigorous reviews that are cited through this document 
can be consulted. The following sections will elaborate 
on the general toxicological aspects of children’s 
susceptibility. 

Exposure 

There are several factors that influence the differential 
exposures of children to environmental toxicants. 
These are: (1) their higher ingestion, drinking, and 
breathing rates per unit of body weight, (2) their 
unique behaviors, (3) where they spend their time, 
and (4) their unique microenvironments. 

Ingestion, Breathing, and Drinking Rates In toxi¬ 
cology, exposure is defined as the contact that occurs 
between a receptor (a living organism) and an environ¬ 
mental agent. The most commonly considered expo¬ 
sure pathways are inhalation, ingestion, and dermal 
absorption. For infants in the womb, transplacental 
transfer of toxic chemicals is another unique route of 
exposure, and for nursing infants, the proportion of the 
toxicant that enters the body through each of these 
pathways will depend on both external, environmental 
factors, and on biological factors that will determine 
the rate of uptake of the chemical by the organism. 
Exposure can also be referred to as the concentration 
that a target organ receives, once the chemical has been 
absorbed. 

The environmental concentration and duration of 
exposure will determine the extent of exposure. In 
addition, other environmental factors such as for 
example, temperature, humidity, and pH, may alter 
pollutant concentrations. For example, the concentra¬ 
tion of some volatile chemicals may vary under differ¬ 
ent temperature conditions. Sometimes, chemical 
reactions between pollutants in the environment may 
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change qualitatively the nature of exposure, such as in 
the case of ozone formation in the air from nitrogen 
oxides (NOx), carbon monoxide (CO), and volatile 
organic compounds (VOCs) in the presence of 
sunlight. 

The patterns of exposure to ingested and inhaled 
toxicants are different between children and adults. 
Young children drink more water on a body weight 
basis than adults do (seven times as much water per 
kilogram of body weight). For the first 6 months, for 
example, children consume on average 88 mL/kg/day 
of tap water directly or indirectly (water added in the 
preparation of formula or fruit juice) compared to 
adults who drink 17 mL/kg/day [6]. As a result of 
this greater consumption of water, children may be 
disproportionately exposed to chemicals found in 
drinking water, including the water used to make infant 
formula [2]. 

Infants and young children have a higher resting 
metabolic rate and rate of oxygen consumption per 
unit body weight than adults. The oxygen consumption 
of a resting infant aged between 1 week and 1 year is 
7 mL/kg body weight per minute, compared to that of 
an adult in the same condition, which is 3-5 L/kg/min 
[5]. Thus, on a body weight basis, the volume of air 
passing through the lungs of a resting infant is twice 
that of an adult, and therefore twice as much of any 
chemical in the atmosphere would pass through the 
lungs of an infant. An additional consideration is the 
smaller lung surface area per kilogram in the early 
stages of development. Thus, the higher amount of 
inspired air will affect a relatively smaller area of lung 
tissue. In addition, children have narrower airways 
than those of adults. Thus, irritation caused by air 
pollution that would produce only a slight response 
in an adult, can result in potentially significant obstruc¬ 
tion in the airways of a young child [6]. Furthermore, 
the fact that children spend more time engaged in 
vigorous activities than adults exacerbates the differen¬ 
tial effects. Children also ingest more food per unit of 
body weight than do adults. A 1-5 year old child eats 
three to four times more food per kilogram than the 
average adult, resulting in larger amounts of chemicals 
and infectious agents per unit of body mass [2, 7]. 
A child’s diet is very different from an adult’s. The 
diet of children contains more milk products and 
more fruits and vegetables per unit of body weight 


than adults. During the first year of life, human milk 
or cow milk-based products constitute most of their 
energy and nutrient source. The NRC report [2] 
reported that over the first year, cow milk products 
comprise 36% and 58% of the diets of nursing and 
non-nursing infants, respectively, compared to adults, 
where milk and milk products constitute only about 
29% of their diet. Environmental pollutants, such as 
PCBs and dioxins, accumulate in fat and have been 
found in breast milk, although it is broadly accepted 
that the benefits of breast-feeding still outweigh the 
risks of exposure [5]. 

After the first few months of life, fruits and fruit 
juices constitute a large proportion of infants’ diet. For 
example, the average consumption of apples for children 
between birth and 5 months of age is almost ten times 
that of an adult older than 20 years old [6] . Fruits can be 
contaminated with pesticides and other toxicants, 
causing children to be exposed to these chemicals. 

Behavioral Patterns Influencing Exposure in 
Children Two characteristic behaviors of children 
have been considered and studied in relation to chil¬ 
dren’s exposure to chemicals: mouthing behavior and 
pica behavior. Infants and toddlers pass through a 
developmental phase, characterized by intense oral 
exploratory behavior, when they indiscriminately 
bring their hands and objects to their mouths. 
Mouthing behavior can result in oral exposures to 
chemicals that may be part of the surfaces of objects, 
or adhered to them in dust particles. 

Pica is a behavior of some children that has been 
defined as the craving or ingestion of nonfood items. 
The cravings found in patients with pica have been 
associated with a nutritional deficiency state, such as 
iron-deficient anemia; with pregnancy or with mental 
illness. Pica tends to disappear as children grow older, 
except for mentally retarded children [8] . Pica behavior 
can give rise to the ingestion of soil, paint, and other 
possibly contaminated substances. About 95% of 
children ingest 0.2 g of soil per day or less, but studies 
have shown that children with pica behavior ingest up 
to 60 g of soil per day [8]. Children living in neighbor¬ 
hoods built on soil contaminated with heavy metals 
(such as lead), aromatic hydrocarbons, and other com¬ 
pounds were found to be at increased risk for elevated 
blood levels [8]. 
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Children may also have higher exposures because of 
their higher levels of physical activity, at least during 
certain stages of their development. When they are 
outdoors, they are more prone to run, jump, or play 
vigorously than adults do. Their breathing rates may 
reach exercise levels more frequently during a day than 
in adults. When near water, for example, children tend 
to spend more time in the water than adults. This can 
lead to higher exposures of air pollutants, such as 
ozone, or to toxicants in water due to the inhalation 
of volatilized chemicals, ingestion of pool water, and 
dermal exposures. 

Other behaviors characteristic of children such 
as overexposure to sunlight for sun tanning among 
adolescents, or drug and alcohol ingestion, can 
also lead to disproportionately higher risks. However, 
in this chapter, the central focus is on those risks that 
are completely involuntary, and that would be difficult 
to change without altering the normal behavior of 
a child. 

The Unique Microenvironment of Children The 

childrens microenvironment is different from that of 
adults from the very beginning of conception. The 
womb, where the fetus is exposed to environmental 
chemicals and other agents from previous exposures 
of the mother is a unique environment to this devel¬ 
opmental stage. In fact, humans are among the living 
mammals that have the longest gestational period, 
resulting in a long in-utero exposure. Many pollutants 
are known to cross the placenta and enter the fetal 
circulation, resulting in fetal exposure to toxicants 
present in the mother. Once the baby is born, breast¬ 
feeding can also result in exposures to chemicals 
accumulated in breast milk. 

Children spend a large proportion of their time 
indoors, and because of their smaller stature and 
because they crawl and play on the floor, they are 
more highly exposed to ground-level contaminants. 
In indoor environments, for example, formaldehyde 
exposure from carpeting would be higher at ground 
level than about 4 or 6 ft above, the breathing zone of 
adults. House dust or particles to which pollutants 
adhere accumulate near the floor both in the indoor 
and outdoor air. Further, outdoors where there is traf¬ 
fic, their breathing zone is close to the height at which 
car exhaust systems expel their fumes. 


Indoor air pollutants can originate indoors, such as 
formaldehyde from carpeting, fumes from home 
cooking, and solvents from freshly painted rooms, or 
can enter the inside of homes through windows or 
open airways. Sometimes, pollutants may be carried 
inside by persons entering the indoor environment. 
For example, children whose parents work on farms 
where pesticides are applied can be exposed to pesti¬ 
cides carried inside in the clothes and skin of adults 
who walk into the house. Studies have found higher 
levels of pesticides in farmworker households 
in agricultural areas compared with those in non¬ 
farmworker households in the same area [9]. In addi¬ 
tion, some studies have demonstrated that degradation 
of pesticides is slower in the indoor environment, so 
pesticide exposure persists longer [9]. 

Toxicokinetics Once exposed, the amount of toxi¬ 
cant being absorbed, its metabolism, and excretion 
from childrens bodies may be very different from that 
of adults. Rates of absorption through the oral, dermal, 
and respiratory pathways vary between children and 
adults. For example, children have a larger lung surface 
area per kilogram of body weight than adults and, 
under normal breathing, breathe 50% more air per 
kilogram of body weight than adults [6]. Therefore 
infants potentially receive a greater internal exposure 
to airborne compounds on a body weight basis. 

Childrens absorption of chemicals through the gas¬ 
trointestinal route is also different from that of adults. 
For example, the absorption of lead from the intestine 
was found to be 40 times higher for children than for 
adults [3]. Gastric pH is higher in newborns than in 
adults, thus causing differences in ionization and 
absorption of certain chemicals. The alkaline gastric 
pH in newborns and infants may enhance the absorp¬ 
tion of certain compounds. 

The skin surface area of children relative to body 
weight is greater for children than for adults, resulting 
in a higher potential dose absorbed through the skin of 
about three times greater for infants than for adults 
[10]. In addition, hydration of the skin in neonates is 
greater than in older children, which potentially could 
result in them having a greater absorption of some 
hydrophilic chemicals. The permeability of the epider¬ 
mal is incomplete in the preterm infant, resulting in 
greater percutaneous absorption of chemical agents [5] . 
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Once absorbed, the different body composition of 
children will result in a different distribution and accu¬ 
mulation of chemicals compared to adults. For exam¬ 
ple, the relatively larger extracellular fluid volume of 
the infant means somewhat greater dilution of water- 
soluble chemicals. However, the lip id-soluble sub¬ 
stances would be distributed in a smaller volume of 
fat in infants relative to adults. The body composition 
in terms of water, fat, and protein changes from birth to 
adulthood resulting in corresponding changes in the 
bioaccumulation, metabolism, and excretion of con¬ 
taminants from the body throughout development. 
Water content, for example, decreases rapidly during 
the first 6 months of life and then remains fairly con¬ 
stant. Body fat, on the contrary, increases rapidly up to 
6 months and then decreases, accounting for similar 
percentages of body weight at ages 4 and 12 months [5] . 
This can result in changes in the concentration of 
chemicals in different tissues and organs that can over¬ 
ride the child’s ability to metabolize and excrete the 
chemical. 

Metabolism and elimination rates are generally lower 
in neonates than in adults. Many metabolic pathways are 
not fully developed in the infant. Renal clearance is lower 
in neonates than in older children and adults for all 
classes of chemicals. These factors will lead to differences 
in the bioaccumulation and persistence of environmen¬ 
tal agents in children’s and adult’s bodies and the con¬ 
centration of the agents in specific organs. The 
maturation of these metabolic and elimination rates 
can result in the mobilization of chemicals, and higher 
exposures of the target organs or systems to the products 
of metabolism. In terms of excretion, certain life stages 
such as pregnancy, lactation, and also menopause, will 
result in the mobilization of chemicals from fat or bone 
stores, and the exposure of other organs or the develop¬ 
ing fetus and lactating child to the toxicants that were 
accumulated in the mother. 

Developmental Aspects of Children’s Susceptibility: 
Critical Periods of Development 

Effects During Gestation From conception to birth, 
the human organism advances rapidly through a 
complex set of developmental processes that culminate 
in the newborn. These development processes include 
cell division, organ formation, and growth as well and 


functional development. During development, some 
biological processes occur only during certain stages 
of development and not in others, or they occur at 
a different rate in different developmental stages. For 
example, cell division in most organs takes place rap¬ 
idly during early development and much more slowly 
at later stages. Other processes such as apoptosis, or 
programmed cell death, occur more widely during 
development and are less prominent during adulthood. 

These biological processes need to be effectively 
coordinated and require the cellular and intercellular 
signaling systems to work correctly. Because of the 
complexity and speed at which these processes take 
place and the intricate relation between them, interfer¬ 
ence at the sequence of any of these processes can have 
damaging and irreversible effects. Exposure to environ¬ 
mental toxicants can have a completely different effect 
depending on whether it occurs at one developmental 
stage or another. In addition, damage due to environ¬ 
mental exposures may occur and manifest itself imme¬ 
diately, or may not appear until subsequent stages of 
development, after development is complete, including 
late adulthood. 

Identifying and understanding these “critical” 
periods of unique susceptibility is essential to develop¬ 
ing strategies to protect children from adverse health 
effects associated with environmental exposures. 
The following sections describe the current state of 
knowledge about the windows of susceptibility during 
childhood development. 

Effects on Germ Cells Germ cells (sperm and egg 
cells) carry the genetic information from each parent 
that will provide the unique genetic blueprint for each 
child. In the male fetus, primordial germ cells develop 
in utero. From puberty to adulthood, these cells 
undergo cell division, mitosis, and meiosis, to produce 
mature sperm and continue to be produced from stem 
cells through adulthood. In females, primordial germ 
cells undergo mitosis and the first phase of meiosis 
during fetal life, and in women, mature oocytes are 
produced every month from follicular cells. 

Environmental toxicants that harm germ cells can 
affect an adult’s own fertility as well as the health of the 
offspring. In animal models, preconceptional carcino¬ 
genesis has been demonstrated for a variety of types of 
radiation and chemicals, with demonstrated sensitivity 
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for all stages from fetal gonocytes to postmeiotic germ 
cells [11]. Although this link in humans is not demon¬ 
strated, it is theoretically possible that environmental 
toxicants that harm germ cells can affect an adult’s own 
fertility as well as the health of the offspring. 

Results of environmental damage to germ cells may 
include reduced fertility later in life or offspring with 
congenital health problems [12, 13]. For example, men 
exposed to diethylstilbestrol (DES) in utero had 
lowered sperm count and increased frequency of 
abnormal sperm [14]. Women exposed to cigarette 
smoke during their mother’s pregnancy had reduced 
fertility [13]. There is a substantial body of evidence 
demonstrating that exposures to environmental agents 
and medical radiation can injure germ cells in such a way 
as to cause increased incidence of cancer, particularly 
leukemia, among offspring of the exposed individuals. 
For example, paternal exposures to benzene have been 
linked to leukemia and lymphoma in children [15]. 
Animal studies support these findings [11]. 

Embryonic and Fetal Development During 
Pregnancy Several stages of embryonic and fetal 
development are susceptible to environmental harm. 
During development, gene expression is very active 
because a large number of genes are being “switched 
on” or “switched off” to control cellular activities. This 
high level of metabolic activity provides for a wide 
range of opportunities for environmental agents to 
interfere with cell development and growth. 

Environmental toxicants may interact directly with 
DNA (e.g., alkylating agents) and disturb gene expres¬ 
sion or may interact with the products of gene 
expression, such as enzymes and control molecules. 
A toxicant that interferes with gene expression may 
prevent the synthesis of enzymes necessary for toxicant 
metabolism, resulting in the accumulation of the tox¬ 
icant in the body. DNA activation by a chemical may 
result in excessive synthesis of enzymes that catalyze the 
bioactivation of a toxicant (production of a toxicant 
metabolite more toxic than the compound originally 
present). 

Chemicals can interfere with the activation or inac¬ 
tivation of genes that occur during early fetal life and 
that may be essential for the protection of the organism 
to external or internal harmful processes. For example, 
interference with genes involved in DNA repair, such as 


p53, a tumor suppressor gene important for DNA 
repair, may result in enhanced vulnerability to specific 
toxicants during development. Studies with transgenic 
mice that are missing this gene have shown an 
increased sensitivity of mice fetuses to benzo[ a] pyrene 
exposure and an increased death rate when exposed to 
the chemical during gestation [16]. Damage to p53 in 
humans could likewise increase sensitivity to agents 
that damage genetic material. 

If a toxicant interferes with cell differentiation, cells 
may not reach their specific form and function neces¬ 
sary for their final role in the body, and organ function 
may be compromised. Also, undifferentiated cells may 
be more vulnerable than differentiated cells to toxic 
effects. Some chemicals such as ethanol [17] and 
TCDD [18] have been demonstrated to affect specific 
types of undifferentiated cells. 

Apoptosis or programmed cell death is a critical 
biological process for healthy development. Apoptosis 
involves the removal of certain cell types when they are 
no longer necessary. In some instances, one type of cell 
is succeeded by another during a specific developmen¬ 
tal period. Apoptosis is involved, for example, in the 
elimination of cells in the immune system that, if they 
survived, could cause autoimmune disease [19]. 
Apoptosis is also critical in the development of the 
nervous system, where phases of cell proliferation 
alternate with phases of apoptosis on the basis of the 
progression of neuronal development [20] and remains 
active through the postnatal period because of ongoing 
nervous system development. 

Normal patterns of apoptosis may be altered 
through altered gene expression or failure of signaling 
mechanisms resulting from environmental exposures. 
Certain autoimmune lympho-proliferative diseases 
and certain cancers have been related to the disruption 
of normal patterns of apoptosis. For example, Wilms’ 
tumor, a relatively common childhood cancer, may 
arise from the transformation by postnatal exposures 
of renal stem cells that fail to disappear 4-6 weeks prior 
to birth [21]. 

Neuronal migration is an important process in 
nervous system development and its alteration 
may result in irreversible damage. For example, 
schizophrenia is thought to result, in part, from 
abnormal neuronal migration [22], but the role of 
prenatal exposures to environmental agents in causing 
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this disease is not clear. Exposures to ionizing radiation 
and methylmercury, for example, have been shown to 
affect the migration of neurons during development 
[20, 23]. 

During the period of organ development, which 
occurs (varying according to organ system) between 
the 3rd and 16th week, disruption of development 
can disrupt the large-scale structure of organs, often 
resulting in physical malformations (congenital anom¬ 
alies). The best known example of such gestational 
damage is exposure to diethylstilbestrol (DES). DES 
caused genital anomalies among male children born of 
women who took the medication before the 11th week 
of gestation twice as often as among those who were 
exposed later in gestation [14]. 

Other effects such as low birth weight, pregnancy 
complications, or late fetal death have been shown to be 
a result of environmental exposures during later stages 
of prenatal development [24]. Disinfection by¬ 
products have been linked to the risk of spontaneous 
abortion for some time. There is now fairly consistent 
evidence for associations between early and late fetal 
deaths and indices of transplacental exposure to 
disinfection by-products [25-27]. Maternal smoking 
during pregnancy increases the risk of pregnancy loss, 
stillbirth, and infant mortality [28]. 

Development During Childhood Several organs and 
systems continue to grow and develop during 
childhood and in some cases almost until adulthood. 
For example, neuron migration, cell proliferation, and 
synapse formation are very active until 3 years of age, 
and myelination continues until adolescence [29] and 
possibly well into adulthood [20]. 

The immune response is also immature at birth 
and develops during infancy and childhood until 
about 1 year of age, while establishment of immuno¬ 
logic memory is not fully established until 18 years of 
age [30]. Exposure to environmental agents during 
early childhood may affect immune system develop¬ 
ment and may contribute to the development of certain 
diseases such as asthma and cancer later in life. 

Physical growth and maturation of organ systems 
continues through adolescence. The process of sexual 
maturation is accompanied by complex interactions 
between the central nervous system and hormone- 
secreting organs, which can be affected by 


environmental exposures. For example, the risk of 
breast cancer has been found to be greater among 
women who were exposed to radiation before 20 
years of age [31]. 

Cellular Metabolism and Biotransformation Many 
important metabolic and biotransformation processes 
are poorly developed in the fetus, and full metabolic 
activity is not fully developed until after childbirth. 
Metabolism can increase or decrease the toxicity of 
a chemical, depending on the metabolic products of 
the chemical and pathway involved. Metabolism may 
also make elimination from the body easier or harder, 
although the most common metabolic pathways 
usually render chemicals more hydrophilic and thus, 
more easily excreted. In some cases, the adult biotrans¬ 
formation of a certain chemical may consist of 
a bioactivation pathway that makes the compound 
more hazardous than the one originally present. The 
absence of a metabolic pathway may result in the 
bioaccumulation of the chemical in the body and 
a later bioavailability and disposition to exert its toxic 
effects. Immaturity could be an advantage if the acti¬ 
vation pathway is not present in the fetus or child and 
there is an alternate pathway for the toxicant to be 
metabolized. However, according to [32], given the 
primary evolutionary function of detoxifying and elim¬ 
inating potentially toxic chemicals, immature or 
underdeveloped metabolic pathways are likely to ren¬ 
der infants and children more sensitive to common 
environmental contaminants. 

Major Groups of Pollutants to Which Children 
are Exposed 

Heavy Metals 

Heavy metals are natural elements that have been 
extracted from the earth and used in human industry 
and products for centuries. As a consequence of human 
activity, concentrations of heavy metals in air, water, 
and surface soil today are hundreds of times higher 
than in the preindustrial era. Some metals are naturally 
found in the body and are essential to the functioning 
of critical enzyme systems. Iron, for example, prevents 
anemia, and zinc is a cofactor in over 100 enzyme 
reactions. Magnesium and copper are other familiar 
metals that, in minute amounts, are necessary for 
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proper metabolism to occur. The body has need for 
approximately 70 trace elements, but there are others, 
such as lead, mercury, aluminum, arsenic, cadmium, 
and nickel, that have no roles in human physiology and 
can be toxic at even trace levels of exposure. Nutrition¬ 
ally, heavy metals can compete with nutrient elements, 
such as the case of lead, which is stored in the bones in 
the place of calcium. 

Metals are notable for their wide environmental 
dispersion, their tendency to accumulate in select tis¬ 
sues, and their overall potential to be toxic at even 
relatively minor levels of exposure. In general, heavy 
metals are systemic toxins with specific neurotoxic , 
nephrotoxic , fetotoxic, and teratogenic effects. Heavy 
metals can directly influence behavior by impairing 
mental and neurological function, influencing neuro¬ 
transmitter production and utilization, and altering 
numerous metabolic body processes. 

Exposure to heavy metals can occur through 
drinking water, air, or ingestion of heavy metal- 
contaminated soil. The amount that is actually 
absorbed from the digestive tract can vary widely, 
depending on the chemical form of the metal and the 
age and nutritional status of the individual. Once 
a metal is absorbed, it distributes in tissues and 
organs. Excretion of metals typically occurs through 
the kidneys and digestive tract, but they tend to persist 
in some storage sites, like the liver, bones, and kidneys, 
for years or decades. 

Lead is one of the best known heavy metals in 
terms of its toxicity. Exposure to lead can occur in 
the prenatal stage through the placenta, and in infants 
through the mother’s milk and the water used in 
milk formula [3]. During pregnancy, body stores of 
lead may be mobilized and transferred from the 
mother to the fetus [33]. Behavioral characteristics of 
children later on, such as the hand-to-mouth behav¬ 
ioral pattern of 1-3 year olds, can result in high expo¬ 
sure and internal levels of lead. Lead paint is a major 
source of environmental exposure for children who 
ingest flaking paint, paint chips, and weathered pow¬ 
dered paint (mostly from deteriorated housing units 
in urban areas). Lead can leach into drinking water 
from lead-based solder used in water pipes. Lead also 
leaches into foods or liquids stored in ceramic con¬ 
tainers made with lead glazing, which is still used in 
some countries. 


The absorption of lead from ingestion of lead- 
contaminated water is higher for children than for 
adults, so that for a given level of exposure, the resul¬ 
tant internal dose is higher in children than in adults 
[3]. Children are also more sensitive than adults to the 
toxicological effects of lead at a given internal exposure 
level. The lowest observed adverse effect levels 
(LOAELs) for several health endpoints occur at lower 
blood lead levels in children than in adults. The most 
sensitive targets for lead toxicity are the developing 
nervous system, the hematological and cardiovascular 
systems, and the kidney. There appear to be no safe 
exposure “thresholds” for lead or for other metals in 
early development. 

Mercury is a ubiquitous heavy metal of both natural 
and anthropogenic sources. Mercury occurs in both 
inorganic and organic forms, and it is most hazardous 
in its organic form of methylmercury. The nervous 
system is very sensitive to all forms of mercury. Meth¬ 
ylmercury and metallic mercury vapors are more 
harmful than other forms, because methylmercury 
can cross the blood brain barrier. Methylmercury in 
the marine and freshwater environment is absorbed by 
fish and shellfish and bioaccumulates in the food chain. 
Increased risk is of particular concern in children and 
in populations that have an increased dietary exposure 
to fish. 

Arsenic occurs naturally in the environment and in 
some areas of the world is a natural contaminant of 
underground water that is used as drinking water. It is 
also an anthropogenic contaminant. Once absorbed 
into the body, arsenic undergoes some accumulation 
in soft tissue organs such as the liver, spleen, kidneys, 
and lungs, but the major long-term storage site 
for arsenic is keratin-rich tissues, such as skin, hair, 
and nails. 

Cadmium is another chemical that is toxic to 
adults, although it has not been extensively studied in 
children. The US Department of Health and Human 
Services has determined that cadmium and cadmium 
compounds are known human carcinogens. Cadmium 
has been linked to diminished kidney function lung 
disease, chronic bronchitis, and lung, kidney, and pros¬ 
tate cancers. In the USA, smoking is the primary source 
of cadmium exposure, although high levels of 
cadmium can also be found in organ meats, shellfish, 
and vegetables. 
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Pesticides 

Pesticides are substances that are used to prevent, repel, 
or destroy pests - organisms that compete for food 
supply, adversely affect comfort, or endanger human 
health (FIFRA 1996). More than 20,000 pesticide 
products with nearly 900 active ingredients are regis¬ 
tered for use as insecticides, miticides, fumigants, 
wood preservatives, and plant growth regulators. It 
cannot be denied that pesticides have beneficial eco¬ 
nomic and also public health impacts. Pesticide usage 
helps improve human nutrition through greater avail¬ 
ability, longer storage life, and lower costs of food. It 
also reduces human labor requirements and attendant 
risks of injury. Pesticides also assist in the control of 
food-borne and vector-borne diseases, such as malaria, 
which kill millions of persons in the world. Pesticides 
also pose human health concerns because they are toxic 
substances and widely spread in the environment. 
Although the toxic mechanisms on targeted pest spe¬ 
cies are well characterized, the potential for adverse 
health effects in humans is not fully known. 

Pesticides are composed of several classes of 
chemicals with different mechanisms of action. Most 
insecticides work by interfering with nervous system 
function. Organophosphates, which account for approx¬ 
imately one-half of the insecticides used in the USA, and 
carbamates, which are widely used in homes and gar¬ 
dens, inhibit the activity of acetyl cholinesterase at nerve 
endings, resulting in an excess of acetylcholine in the 
synapsis and a depolarizing blockage of neural transmis¬ 
sion. The effects of carbamates are readily reversible and 
of shorter duration. Organochlorines, such as dichloro- 
diphenyltrichloroethane (DDT) and lindane, interfere 
with nerve cell membrane cation transport, resulting in 
neural irritability and excitation of the central nervous 
system. Herbicides, including the chlorophenoxy com¬ 
pounds 2,4 D and 2,4,5 T are primarily irritative to the 
skin and respiratory tract during acute exposures and 
work by different mechanisms. Some substances, such as 
paraquat, are highly corrosive and can cause multisystem 
injury and progressive pulmonary failure [34]. 

Arsenical pesticides, such as copper chromium 
arsenate, have been used, until recently, as wood pre¬ 
servatives to prolong the useful life of exterior wooden 
structures. These compounds cause central nervous 
system depression at sufficient doses. 


Pesticides are ubiquitous in the environment. They 
are found in food, water, homes, schools, workplaces, 
lawns, and gardens. They are present in soils that have 
been spread with pesticides or where pesticides from 
adjacent agricultural areas have drifted, and may reach 
water supplies from agricultural runoff. In the USA 
alone, more than 0.45 billion kilograms of pesticides 
are applied each year, in agriculture, in homes and 
gardens, and in schools and hospitals. In developing 
countries, many highly toxic and biologically persistent 
pesticides such as parathion, DDT, and paraquat, 
which are no longer permitted in many developed 
countries, are still in wide use and result in chronic 
exposures and acute, too often fatal poisonings of 
thousands of young children each year. 

Most children in the world are exposed to some 
degree to pesticides. Children in rural and agricultural 
areas and especially children whose parents are farm¬ 
workers or pesticide applicators are at highest risk of 
having increased exposures to pesticides. Pesticides 
may reach their homes due to the drifting of pesticides 
that are applied to the ground through aerial spraying. 
Children may work or play near their parents in the 
fields where pesticides have been used. Parents who 
work with pesticides may bring pesticides to their 
homes, impregnated in their clothes and bodies. In 
countries where residential housing with gardens and 
lawn predominate, homes and garden pesticide use 
may result in significant levels of exposure [34]. 

Exposure of children to pesticides may occur 
through inhalation, ingestion, and dermal absorption. 
Ingestion of pesticides occurs either through accidental 
exposure due to pesticides stored in food containers 
(i.e., soft drink bottles), or through ingestion of pesti¬ 
cide-treated foods, particularly fruits and vegetables. 
Foods grown in pesticide-contaminated soils and fish 
from pesticide-contaminated water can also carry sig¬ 
nificant amounts of pesticides. Children may also 
ingest pesticides adhered to the surface of toys or 
other objects through their hand-to-mouth behavior. 
The potential of dermal exposure of children to pesti¬ 
cides is higher than that of adults because of their 
relatively large body surface area and extensive contact 
with lawns, gardens, and floors by crawling and playing 
on the ground. 

Prenatal and early childhood exposures are of 
special concern because of the susceptibility of the 


E 




3632 


E 


Environmental Toxicology: Children at Risk 


developing organ systems to pesticides (the central 
nervous system in particular) as well as behavioral, 
physiological, and dietary characteristics of children. 
Breast-feeding infants may ingest pesticides or 
pesticide metabolites present in the breast milk. 
The quantity of pesticide that is passed to the infant 
via breast milk is influenced by many variables such as 
maternal age and parity, maternal body burden of the 
chemical, and breast-feeding patterns. As infants are 
weaned and progress to solid foods, they consume, per 
unit of body weight, proportionally more fruit and 
more fruit juice than adults. The NAS in 1993 reported 
that children s dietary exposures to pesticides differed 
from adults both quantitatively and qualitatively 
and questioned the protection provided to infants 
and children from pesticide tolerances in effect at the 
time. The report estimated that 50% of lifetime pesti¬ 
cide exposure occurs during the first 5 years of life [2]. 

Environmental Tobacco Smoke 

Environmental tobacco smoke (ETS), also known as 
second-hand smoke, is exhaled smoke and sidestream 
smoke emitted from the burning of the tip of the 
cigarette. The inhalation of ETS is known as “involun¬ 
tary smoking” or “passive smoking.” ETS contains 
more than 4,000 different chemical compounds, 
many of which are toxic. In 1992, the US Environmen¬ 
tal Protection Agency (EPA) declared ETS as a Group 
A carcinogen. 

The effects of passive smoking begin in utero, where 
constituents of tobacco smoke, such as PAHs, nicotine, 
and carbon monoxide, cross the placenta and are 
concentrated in the fetal circulation [35]. Children are 
also exposed during childhood if any of the parents 
smoke. 

Persistent Organic Compounds: PCBs, Dioxins, 
and Related Organohalogens 

Polychlorinated biphenyls are synthetic compounds 
with two linked phenyl rings and variable degrees of 
chlorination. They have been used for many years 
because of their thermal and chemical stability. They 
are nonvolatile, hydrophobic oils that are not easily 
biotransformed in the environment or metabolized by 
the human organism, so they are very persistent in the 
environment, and bioaccumulate in the food chain 


and in the fat compartment of the human body. 
Although they have been banned in the USA for more 
than 30 years, they are still widely present in the 
environment. They have been found in wildlife, 
human tissue, and human milk. Polychlorinated 
dibenzofurans (PCDFs) are partially oxidized PCBs 
that appear as contaminants of PCBs. Polychlorinated 
dibenzodioxins (PCDDs), commonly referred as 
dioxins, are formed during paper bleaching and 
waste incineration. One dioxin congener, 
2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD), is con¬ 
sidered to be the most toxic synthetic chemical known. 

The most significant source of exposure is contam¬ 
inated food, particularly fish at the top of their trophic 
chain. Of greatest concern are the populations that 
consume high amounts of fish. Because PCBs and 
PCDFs are not metabolized or excreted, they can accu¬ 
mulate in the fat tissue of the body, as well as in human 
milk, resulting in high exposures of the developing 
fetus and the newborn. 

Disinfection By-products (DBPs) 

Disinfection by-products (DBPs) form when disinfec¬ 
tants are added to drinking water and react with natu¬ 
rally occurring organic matter. Chlorine, the most 
widely used primary disinfectant, reacts with naturally 
occurring organic matter to form a range of unwanted 
by-products such as the trihalomethanes (THMs) 
which include chloroform, bromodichloromethane 
(BDCM), chlorodibromomethane (DBCM), and the 
haloacetic acids (HAAs), such as monochloroacetate, 
dichloroacetate, and trichloroacetate. 

Exposure to DBPs occurs through ingestion of 
water or through inhalation and absorption during 
showering, bathing, and swimming. While there is 
some concern that these chemicals may pose a health 
risk, the potential risks arising from not treating drink¬ 
ing water are considerably greater, and the disinfection 
of water should never be compromised as a result. 

Environmental Threats to Children on Specific 
Organ Systems 

Nervous System 

The developing nervous system is more susceptible 
than the adult brain to the disrupting effect of toxic 
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chemicals [5]. The lengthy period of brain develop¬ 
ment and the extensive number of processes needed 
to take place contribute to the susceptibility of the 
developing nervous system. In the 9 months of preg¬ 
nancy, the human brain and spinal cord must develop 
from a thin strip of cells along the back of the 
embryo into a complex organ comprised of billions of 
precisely located, highly interconnected, and special¬ 
ized cells. Brain development requires that neurons 
move along precise pathways from their points of 
origin to their assigned location, that they establish 
connections with other cells near and distant, and 
that they learn to intercommunicate. Each connection 
between and among neurons must be precisely 
established at a particular point in development, and 
redundant connections need to be pruned away 
through programmed cell death, apoptosis. All these 
processes must take place within a tightly controlled 
time frame, in which each developmental state must 
be reached on schedule and in the correct sequence. 

Critical windows of vulnerability, which exist only 
in the 9 months of pregnancy and to a lesser extent in 
early childhood, are unique to early brain development. 
They have no counterpart in adult life. Exposure to 
toxic chemicals during these windows of vulnerability 
can cause devastating damage to the brain and nervous 
system. 

Any toxic or other environmental exposure that 
interferes with the tightly orchestrated sequence of 
events involved in brain formation is likely to have 
profound effects on intellect, behavior, and other func¬ 
tions. If a developmental process in the brain is halted 
or inhibited, if cells fail to migrate the proper sequence 
to their assigned locations, if synapses fail to form, or if 
pathways are not established, there is only limited 
potential for late repair, and the consequences can be 
permanent [36]. 

Environmental toxicants can affect both the 
structural and functional development of the nervous 
system. Depending on the developmental stage at 
which exposure occurs, sensory development, intelli¬ 
gence, or behavior will be affected differentially. While 
early-developing neural systems have been considered 
the most vulnerable to chemical insult, scientists have 
called attention to the importance of chemical expo¬ 
sure that occurs late in childhood, as it has been 
recently suggested that behavioral and physiological 


foundations of cognition continue to develop during 
childhood and adolescence [5, 37]. 

Exposure to environmental toxicants such as lead, 
methylmercury, and certain pesticides and PCBs even 
at very low levels have been shown to produce 
neurobehavioral (functional) deficits, and increased 
susceptibility to neurodegenerative diseases much 
later in life [38]. Of critical concern is the possibility 
that developmental exposure to neurotoxicants may 
result in an acceleration of age-related decline in func¬ 
tion that could lead to Parkinson Disease, Alzheimer 
Disease, and other forms of brain degeneration. This 
concern is compounded by the fact that developmental 
neurotoxicity that results in small effects on the 
individual at a particular stage can have a profound 
societal impact when considered in the whole popula¬ 
tion or across the life span of the individual [36]. For 
example, a five-point decline in average population 
intelligence (IQ score) can result in reduction by 
more than 50% in the number of children with supe¬ 
rior IQ (>130) and concomitant doubling in the num¬ 
ber of children with IQ scores in the retarded range 
(<70). Behavior is also disrupted, sometimes perma¬ 
nently, by such exposures. 

The following sections will give a short overview of 
some neurotoxicological effects of the best known 
chemicals for their effect on the developing nervous 
system. Some toxicants such as ethanol have been 
excluded, as the focus of this chapter is on exposures 
to toxicants to which the majority of children are 
exposed. 

Lead Lead is one of the best studied toxicants and one 
of the few pollutants for which susceptibility of 
children has been clearly established. The best known 
health effects on children are its neuropsychological 
effects although other effects have also been studied 
and are partially documented. 

The neurotoxic effects of lead in children have been 
extensively studied. One distinctive characteristic of 
the research findings on lead neurotoxicity over the 
past 2-3 decades is that it has led to a progressive 
decline in the LOAELs (lowest observed adverse effect 
levels). The early finding in the 1970s by Landrigan 
et al. [39] and Needleman [40] that low-dose exposure 
to lead is associated with a significant decrease in intel¬ 
ligence quotient (IQ) has been confirmed in many 


E 




3634 


E 


Environmental Toxicology: Children at Risk 


studies. The recommended action level of blood lead 
levels established by the Centers for Disease Control is 
10 pg/dL of blood. However, epidemiological studies 
during the last decade have found strong evidence 
for cognitive deficits among school-aged children at 
blood lead levels below the current CDC action level 
[41, 42]. Further, in a pooled analysis of seven prospec¬ 
tive longitudinal studies, the average IQ deficit associ¬ 
ated with an increase in concurrent blood lead 
concentration from <1 to 10 pg/dLwas about threefold 
that for an increase from 10 to 20 pg/dL [42]. Birth 
cohort studies have shown inverse dose-response rela¬ 
tionships between transplacental lead exposure indices 
and central auditory processing indices among 
infants and children at maternal blood lead levels 
below 10 pg/dL [43]. 

Research has also demonstrated a link between 
developmental lead exposure and behavioral outcome. 
In a prospective study, the behavior of lead-exposed 
children at 8 years of age was significantly related to 
tooth dentine levels [44], suggesting that social and 
emotional difficulties correlate with lead exposure. In 
another prospective, longitudinal study, both prenatal 
and postnatal lead exposure was related to antisocial 
and delinquent behavior in adolescents [45]. It is gen¬ 
erally accepted in the scientific and medical community 
that the adverse neurobehavioral consequences of lead 
are not reversible and remain in place across the life 
span [41]. Further, the possibility that a threshold 
level for the effects of lead does not exist has been 
suggested [42]. 

Mercury Methylmercury is a well-established neuro- 
toxicant that can cause serious adverse effects on the 
development and functioning of the human central 
nervous system, especially when exposure occurs pre- 
natally. The well-known episodes of community-wide 
poisoning in Japan and Iraq revealed the particular 
sensitivity of the fetus to the toxic effects from mercury 
exposure. In these communities, pregnant women 
exposed to methylmercury and who themselves had 
no or minimal symptoms, had babies with devastating 
neurological handicaps, including delayed attainment 
of developmental milestones, blindness, deafness, and 
cerebral palsy. 

At levels of exposure lower than those encountered 
in the Minamata Bay, there is limited epidemiological 


evidence of neuropsychological effects. A birth cohort 
of 1,000 children was established at the Faroe Islands in 
1986-1987, and the methylmercury exposure was 
determined from the mercury concentration in the 
cord blood [46]. More than 90% of these children 
were then examined at age 7 years. Neuropsychological 
effects in the areas of language, attention, and memory 
and to a lesser extent in visuospatial and motor func¬ 
tions were observed [46]. In Brazil, cross-sectional 
studies of Amazonian children aged 7-12 years show 
mercury-associated effects in agreement with the Faroe 
findings [47] . However, a large cohort study conducted 
in the Seychelles did not reveal consistent associations 
between perinatal methylmercury exposure indices and 
developmental milestones or neuropsychologic test 
scores up to age 5 years old [48]. 

Some studies have found an association between 
central auditory processing deficits and current child¬ 
hood hair mercury levels as low as 5 pg/g [49]. These 
findings are supported by animal studies, showing that 
transplacental or postnatal methylmercury exposure 
affects auditory systems at the cortical level [50]. 

The important question is to which degree these 
findings relate to fish-consuming populations in gen¬ 
eral. Although this question cannot be answered with 
any confidence at this time, it looks like the 
recommended one to two fish meals per week during 
pregnancy would be very unlikely to cause any risk to 
the fetus, unless the seafood is severely contaminated. 
The National Academy of Sciences recommended that 
a limit of about 0.1 pg/kg of body weight per day 
should not be exceeded by pregnant women [51]. 

PCBs, Dioxins, and Related Organohalogens 

Polychlorinated biphenyls are known to interfere with 
thyroid hormones, some of which are critical for nor¬ 
mal brain development, and this effect of PCBs on the 
thyroid function of the newborn has been postulated 
as the mechanism of action of neuropsychological 
effects [52]. 

The earliest evidence of PCB-related neurotoxicity 
comes from the poisoning episodes in Japan (Yusho) 
and Taiwan (Yucheng) where people became ill after 
ingesting rice oil that was highly contaminated with 
PCBs. Infants born to mothers who consumed PCB- 
contaminated rice oil during pregnancy were at 
increased risk for low birth weight, abnormal brown 
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pigmentation of the skin, and clinical abnormalities of 
the gingival, skin, nails, teeth, and lungs [53]. In addi¬ 
tion, children of both cohorts had various 
neurobehavioral deficits such as delayed attainment of 
developmental milestones, lower scores on intelligence 
tests, and higher activity levels [54]. Children were 
followed and examined 6 years later and showed per¬ 
sistent behavioral abnormalities and ectodermal 
defects [55]. 

Many studies after the Taiwan event provide 
evidence of neuropsychological effects of PCBs [56]. 
The evidence indicates that PCBs cause 
neurobehavioral deficits in children who are exposed 
prenatally, while the evidence of effects of exposure to 
background levels commonly found in the general 
population is not conclusive. Studies support the 
association between low-level transplacental PCB 
exposure and childhood deficits in cognitive, psycho¬ 
motor, memory, language, and attention functions 
[56, 57]. None of the studies up to date has been able 
to document an adverse effect of PCBs exposure from 
breast-feeding. It has been postulated that the larger fat 
compartment of the nursing infant than the fetus may 
make it possible to dilute somewhat the lipid-soluble 
contaminants absorbed through human milk. Thus, 
despite the occurrence of these contaminants in 
human milk, the advantages of breast-feeding for 4-6 
months apparently override any limited neurotoxin 
damage due to the contaminants. Still, of particular 
concern would be the populations who frequently eat 
contaminated fish or who reside in contaminated areas 
and may still be exposed at levels that have been asso¬ 
ciated with adverse effect. 

Pesticides Current knowledge about the neurotoxi- 
cological effects of pesticides is limited, despite the 
extensive knowledge available of the mechanisms of 
pesticide toxicity in animals. One observational study 
in children from a region in Mexico with intensive 
pesticide use found a variety of developmental delays 
compared with otherwise similar children living in 
a region where the population had not adopted 
a pesticide-based agriculture. The children were similar 
in growth and physical development, but significant 
delays were noted among the exposed children in phys¬ 
ical stamina, gross and fine hand-eye coordination, 
short-term memory, and ability to draw a human 


figure [58]. However, these conclusions have been 
questioned, as pesticide levels were not reported for 
the individual children who received neuropsycholog¬ 
ical testing. 

Fetal and neonatal animals are often more sensitive 
than adults to the neurotoxic effects of some organo- 
phosphates [2], and levels presumed to be nontoxic in 
adults may not be adequately protective of the devel¬ 
oping organisms. For example, the young rat is defi¬ 
cient in the actions of two enzymes that detoxify 
chlorpyrifos, and young rats have increased sensitivity 
to chlorpyrifos toxicity [59]. In rats, chlorpyrifos 
appear to affect cholinergic function [60]. Infants 
may be particularly vulnerable to reductions in brain 
cholinesterase given that acetyl choline plays an impor¬ 
tant role in normal brain development and plasma and 
erythrocyte (and therefore probably brain) cholinester¬ 
ase do not reach adult values until 6-12 months of 
age [61]. 

Animal studies suggest that exposure to pesticides 
such as DDT and its metabolites DDE and dichlorodi- 
phenyldichloroethane at the levels found in the envi¬ 
ronment affect the developing brain. Ten-day old mice 
treated once with DDT showed behavioral changes 
compared with controls when tested at 4 months of 
age. These effects occur only after DDT dosing at 
10 days of age and not after similar dosing at 3 or 
19 days of age [62]. 

In spite of the absence of direct conclusive evi¬ 
dence of the neuropsychological effects of pesticides, 
the known subclinical effects observed in adults 
through neuropsychological testing, and what is 
known on the mechanism of action of pesticides 
and susceptibility of the developing brain, has led 
many investigators to infer that chronic low-dose 
exposure to certain pesticides might pose a potential 
hazard to the health and development of infants and 
children. Other investigations have concluded such 
inferences can be neither supported nor refuted at 
the present time. 

Endocrine System 

The endocrine system is one of the body’s main com¬ 
munication networks and is responsible for controlling 
and coordinating numerous body functions. Hor¬ 
mones are first produced by the endocrine tissues, 
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such as the ovaries, testes, pituitary, thyroid, and pan¬ 
creas, and then secreted into the blood to act as the 
body’s chemical messengers. They direct communica¬ 
tion and coordination among other tissues throughout 
the body, and they exert a powerful influence over 
growth and development of every organ system in 
the body. 

The endocrine system regulates metabolic, nutri¬ 
tional, reproductive, and behavioral processes, as well 
as growth, responses to stress, and the function of the 
digestive, cardiovascular, renal, and immune system. 
Programming of endocrine set points is a unique aspect 
of endocrine system development that takes place 
during fetal/neonatal development, and exposure to 
toxicants during this critical period of programming 
can result in permanent abnormalities in endocrine 
function [5]. Disruption of endocrine function can 
have severe health consequences in adults, and 
exposures that interfere with the development of the 
endocrine system during early life stages can have even 
more far-ranging consequences [63]. 

Endocrine disruptors are natural compounds or 
man-made chemicals that may alter the production 
or activity of hormones of the endocrine system leading 
to adverse health effects. Although there is limited 
scientific information on the potential adverse human 
health effects, concern arises because endocrine 
disrupting chemicals, while present in the environment 
at very low levels, have been shown to have adverse 
effects in wildlife species, as well as in laboratory ani¬ 
mals at these low levels. Many of these chemicals have 
been linked with developmental, reproductive, neural, 
immune, and other problems in wildlife and laboratory 
animals. The potential adverse effects of EDCs include 
neurodevelopmental and neurobehavioral abnormali¬ 
ties, reproductive disorders such as declined fertility, 
immune impairment, and certain hormone-related 
cancers [1]. 

An example of a phthalate is di(2-ethylhexyl) 
phthalate (DEHP). DEHP is a high-production- 
volume chemical used in the manufacture of a wide 
variety of consumer food packaging, some children’s 
products, and some polyvinyl chloride medical devices. 
Recently, an independent panel of experts assembled by 
the National Toxicology Program (NTP) found that 
DEHP may pose a risk to human development, espe¬ 
cially critically ill male infants. 


In 2000, an independent panel of experts convened 
by the NIEHS and the National Toxicology Program 
(NTP) found that there was “credible evidence” that 
hormone-like chemicals can affect test animals’ bodily 
functions at very low levels - well below the “no effect” 
levels determined by traditional testing. Although there 
is little evidence to prove that low-dose exposures are 
causing adverse human health effects, there is a large 
body of evidence in experimental animals and wildlife 
suggesting that endocrine disruptors may cause [1]: 

• Reductions in male fertility and declines in the 
numbers of males born 

• Abnormalities in male reproductive organs 

• Female reproductive diseases including fertility 
problems, early puberty, and early reproductive 
senescence 

• Increases in mammary, ovarian, and prostate cancers 

Endocrine disruptors may interfere with the endo¬ 
crine system through several mechanisms. Some mimic 
or partially mimic occurring hormones in the body like 
estrogens and androgens and thyroid hormones, 
potentially producing overstimulation. Another group 
of natural and synthetic substances interferes with the 
hormones at receptors. Substances that compete with a 
hormone at the receptor, and imitate its effect are called 
agonists, those that block the receptor are antagonists. 
Other chemicals interfere, or block the way natural 
hormones or their receptors are made or controlled, 
for example, by blocking their metabolism in the liver. 
Environmental chemicals with estrogenic activity are 
probably the most well studied; however chemicals 
with antiestrogen, androgen, antiandrogen, progester¬ 
one, or thyroid-like activity have also been identified. 

A wide range of substances are thought to cause 
endocrine disruption. Chemicals that are known endo¬ 
crine disruptors include diethylstilbestrol (DES), 
dioxin and dioxin-like compounds, PCBs, DDT, and 
some other pesticides. Some chemicals, particularly 
pesticides and plasticizers, such as Bisphenol A are 
suspected endocrine disruptors based on animal 
studies. 

Phthalates, a class of chemicals that soften and 
increase the flexibility of polyvinyl chloride plastics, 
are considered to be endocrine disruptors on the 
basis of extensive animal research plus emerging 
studies in humans. An example of a phthalate is 
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di(2-ethylhexyl) phthalate (DEHP). DEHP is a high- 
production-volume chemical used in the manufacture 
of a wide variety of consumer food packaging, some 
children s products, and some polyvinyl chloride med¬ 
ical devices. Recently, an independent panel of experts 
assembled by the National Toxicology Program (NTP) 
found that DEHP may pose a risk to human develop¬ 
ment, especially critically ill male infants. 

Research shows that endocrine disruptors may pose 
the greatest risk during prenatal and early postnatal 
development when organ and neural systems are devel¬ 
oping. In animals, adverse consequences, such as 
subfertility, premature reproductive senescence, and 
cancer, are linked to early exposure, but they may not 
be apparent until much later in life. 

There is some evidence that endocrine disruptors 
may not only impact the individual directly exposed 
but also future generations. It has been found that 
animals exposed to low doses of the natural human 
estrogen estradiol, or the environmental estrogen 
bisphenol A (BPA) (a chemical used in great quantities 
in the production of polycarbonates and epoxy resins), 
during fetal developmental and estradiol as adults were 
more likely to develop a precursor of prostate cancer 
than those who were not exposed [64]. This suggests 
that exposure to environmental and natural estrogens 
during fetal development could affect the way prostate 
genes behave, and may lead to higher rates of prostate 
disease during aging. It has also been shown that the 
adverse effects of diethylstilbestrol in mice can be 
passed to subsequent generations even though they 
were not directly exposed [64]. The increased suscep¬ 
tibility for tumors was seen in both granddaughters 
and grandsons of mice who were developmentally 
exposed to DES [65]. One study found that endocrine 
disruptors caused fertility defects in male rats that were 
passed down to nearly every male in subsequent gen¬ 
erations [66]. These intergenerational effects may be 
the consequence of epigenetic changes caused by endo¬ 
crine disruptors. 

There is concern that alterations in thyroid hor¬ 
mone signaling by endocrine disruptors during fetal 
and neonatal development could disrupt central ner¬ 
vous system development. There have been several 
epidemiological studies of the neurobehavioral effects 
of in utero exposure to PCBs that have been mentioned 
in the section on nervous system effects. Thus far, 


few studies have directly linked neurobehavioral 
effects of exposure to polyhalogenated hydrocarbons 
to disruption of thyroid hormone signaling. One 
example is a study that showed that low-frequency 
hearing loss caused by developmental exposure in rats 
to PCBs could be partially reversed by replacement of 
T4 [67]. In animals, many polyhalogenated aromatic 
hydrocarbons such as dioxins and PCBs alter thyroxin 
levels via an increased metabolism and excretion of 
these hormones [52]. 

Perchlorates are another class of environmental 
chemicals that affect thyroid function via inhibition 
of iodine uptake by the thyroid gland, reducing T4 
and T3 synthesis. There has been a concern that con¬ 
tamination of drinking water with perchlorates from 
industrial sites would suppress fetal thyroid hormone 
synthesis, disrupting central nervous system develop¬ 
ment. Increased rates of congenital hypothyroidism 
have been found in communities with detectable per¬ 
chlorate levels in the drinking water [68], although 
others have not detected such elevated rates [69]. 

Evidence from epidemiological studies demon¬ 
strates that exposures during early life can result in 
greater susceptibility to diabetes and obesity later in 
life [70]. In the course of development of the pancreas 
and pancreatic function, for example, several windows 
of susceptibility have been identified [71]. Studies 
about the Dutch Famine Winter have shown that 
poor maternal nutrition, especially during the last tri¬ 
mester of pregnancy had an impact on glucose toler¬ 
ance and insulin resistance, and that in terms of obesity, 
those individuals born to mothers who were exposed to 
the famine around the first half of pregnancy were 
more obese than those of mothers exposed during the 
last trimester [72]. A number of subsequent studies 
have documented the role of the intrauterine environ¬ 
ment, and low birth weight in the development of 
diabetes [71, 73] and other metabolic disorders. 

Reproductive System 

The development of the reproductive system is a long 
process that takes place from the beginning of gesta¬ 
tion, when organ development starts to take place, 
to maturation of the reproductive system during 
puberty. This long developmental period provides for 
several windows of susceptibility. Adverse effects of 
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reproductive toxicants can become manifest at birth 
(e.g., hypospadias and cryptorchidism in humans), in 
puberty (as delay or precocity), or in adulthood (e.g., 
infertility, alterations in accessory sex organs, distur¬ 
bances in pregnancy maintenance, endometriosis, or 
premature reproductive senescence) [74]. 

The developmental susceptibility of the reproductive 
system was clearly demonstrated through the diethylstil- 
bestrol (DES) exposure. In utero exposure of men to 
DES has been linked to increased incidence of meatal 
stenosis, epididymal cysts, testicular hypoplasia, crypt¬ 
orchidism, microphallus, and sperm abnormalities. In 
females, DES exposure resulted in adenosis, clear cell 
adenocarcinoma, and structural defects of the cervix, 
vagina, uterus, and fallopian tubes [75]. 

Environmental tobacco smoke has been associated 
with decreased fecundity and earlier menopause in 
women smokers [76]. Women whose mothers smoked 
while pregnant also had reduced fecundity compared 
with women whose mothers did not smoke [ 13 ]. In men 
whose mothers smoked tobacco during pregnancy, 
reduced semen quality, smaller testis size, and reduced 
fecundability odds ratios have been observed [77]. 

Phthalates such as diethylhexyl phthalate (DEHP) 
are developmental toxicants in experimental animals. 
The organ system most sensitive to phthalates is the 
reproductive tract of immature males [78]. In utero 
exposure of male rats to some phthalate esters results 
in changes in the reproductive tract, such as decreased 
anogenital distance, hypospadias, cryptorchidism, dis¬ 
turbed development of prostate, epididymis, vas 
deferens, and seminal vesicles, retained nipples and 
decreased sperm production [79-82]. In humans, sim¬ 
ilar dysgenetic changes in the histology of the testis 
have been found in patients with testicular cancer, 
subfertility, or cryptorchidism [83, 84]. It has been 
hypothesized that all these human disorders (testicular 
germ cell cancer, cryptorchidism, hypospadias, and low 
sperm counts) have common origins in fetal life and 
that they all represent different symptoms testicular 
dysgenesis syndrome [87]. 

Neonatal Mortality, Growth Restriction, and Birth 
Defects 

Birth defects are normally defined as structural prob¬ 
lems in the newborn, attributable to faulty 


development or deformation, but defects in function, 
metabolism, or body chemistry that lead to physical or 
mental problems or to death may also be considered 
birth defects. The broader term “developmental disor¬ 
ders” is generally used when considering all effects 
observed on the conceptus from fertilization to sexual 
maturity. Developmental disorders include both struc¬ 
tural birth defects and functional defects, such as blind¬ 
ness, deafness, or neurobehavioral disabilities. 

The majority of birth defects are considered the 
result of multiple environmental and/or genetic causes 
acting together. Environmental toxicants studied for 
their relation with birth defects include maternal 
smoking and alcohol use, pesticides, disinfection by¬ 
products, plastics and plastic components, solvents, 
metals and numerous air pollutants [88]. Other envi¬ 
ronmental causes such as maternal diseases (e.g., 
rubella), and use of pharmaceuticals (e.g., valproic 
acid, an anticonvulsant, and mood stabilizers), have 
served to document the potential environmental role 
in developmental birth defects. 

Many epidemiological studies have attempted to 
evaluate whether some chemical exposures are linked 
to increased rates of spontaneous abortions. For exam¬ 
ple, occupational exposure of the mother to organic 
solvents has been associated with spontaneous abor¬ 
tion in several studies [89, 90]. A review of six occupa¬ 
tional studies found suggestive evidence for an 
association between toluene exposure and spontaneous 
abortion, although most workers were simultaneously 
exposed to multiple chemicals, which may have con¬ 
founded the interpretation of results. According to this 
review, spontaneous abortion has generally not been 
observed as a major problem among highly exposed 
women who abuse toluene during pregnancy. 

Many studies have investigated the effects of disin¬ 
fection by-products (DBPs) on the developing fetus and 
children. Review of epidemiologic studies reveal that 
there is fairly consistent evidence for associations 
between early and late fetal deaths and indices of trans¬ 
placental DBP exposure [25, 26, 89]. Associations 
between late fetal deaths and drinking water chloro¬ 
form, bromodichloromethane, and total THM concen¬ 
tration at levels below the Canadian drinking water 
guideline of 100 pg/L have been found in Canada [90]. 
In England, increased risks of late fetal deaths among 
women living in regions with THM concentrations 
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above 30 pg/dL [91]. These findings are supported 
by animal studies, where high prenatal maternal expo¬ 
sure to chloroform, bromodichloromethane (BDCM), 
haloacetonitriles, or haloacetic acids produced fetal 
resorption and reduced fetal survival [26]. 

Evidence on the effects of maternal DBP exposure 
and SGA (small-for-gestational-age) effects is mixed. 
Recent reviews have found limited evidence for associ¬ 
ation with this effect [25, 26], but a cohort study in 
Massachusetts demonstrated associations between 
SGA and drinking water trihalomethanes and muta¬ 
genic activity levels during the third trimester in the 
town of maternal residence [92]. In experimental ani¬ 
mals, high prenatal maternal exposure to chloroform, 
BDCM, haloacetonitriles, or haloacetic acids reduced 
fetal weight [26, 27]. 

The evidence of the effects of chlorination disinfec¬ 
tion by-products in drinking water and birth defects is 
inconclusive. Some studies have shown an association 
between neural tube defects and prenatal maternal 
DBP [25, 26]. The evidence of effects of water disinfec¬ 
tion from five previous studies with a cross-sectional 
study of all Taiwanese births in years 2001-2003 was 
reviewed [93] and evidence of an effect of exposure to 
chlorination by-products on the risk of neural tube 
defects, urinary system defects, and ventricular septal 
defects was concluded. However, a recent review of 
a small number of recent studies reported inconsistent 
results for an association between drinking water 
chlorination by-products and risk of al congenital 
anomalies combined and of specific groups of anoma¬ 
lies, and there was little evidence of an exposure- 
response relationship [94]. 

Maternal smoking during pregnancy increases the 
risk of pregnancy loss, stillbirth, and infant mortality 
[28]. There is some strong evidence that exposure to 
ETS increases the probability of preterm birth [95]. 
A large birth cohort study in California found border¬ 
line statistical association between maternal serum 
cotinine concentrations during early pregnancy and 
late fetal deaths (borderline statistical significance), 
preterm birth, and a significant association with 
reduced birth weight (significant) [24]. 

The association between sudden infant death 
syndrome (SIDS) and maternal smoking has been 
firmly established. As reported by Wigle et al. [98], 
a meta-analysis of 39 epidemiological studies and 


an expert panel review concluded that there is suffi¬ 
cient evidence of a causal association between sudden 
infant death syndrome and postnatal ETS exposure, 
independent of prenatal maternal active smoking 
[95]. Sudden infant death syndrome was also associ¬ 
ated with paternal smoking, even when the mothers did 
not smoke. 

Active smoking by mothers has been shown to sig¬ 
nificantly reduce the rate of fetal growth, and the effect 
was shown to be dose-dependent. DiFranza and Lew 
[97] using data from 23 studies, calculated an odds 
ratio of 1.82 for the association between maternal 
smoking and low birth weight (<2,500 g). The risk of 
intrauterine growth retardation caused by maternal 
smoking appears to increase with maternal age, from 
twofold for mothers aged 17 years to fivefold for 
mothers aged 35 years [98]. Although smoking 
throughout pregnancy is known to affect birth weight, 
there is some evidence that the final trimester may be 
particularly important [99]. In some studies, babies of 
smoking mothers have been reported to be shorter and 
to have smaller head circumferences at birth than 
babies of nonsmoking mothers [98]. 

Studies on pesticide exposures and birth defects 
have found evidence of associations between paternal 
pesticide exposure and cryptorchidism [99], although 
other studies have failed to find a similar association. 
Cryptorchidism was associated also with maternal 
serum DDT/DDE and hexachlorobenzene levels [100] 
and with adipose tissue or maternal serum DDE con¬ 
centrations [101]. In animal studies, trans-placental 
exposure to the pesticides DDT/DDE vinclozolin, 
procymidone, or linuron, produce hypospadias, crypt¬ 
orchidism, and other abnormalities [102]. The epide¬ 
miological evidence of the link between pesticide 
exposure and hypospadias is not conclusive enough 
to affirm a link between pesticide exposure and this 
type of birth defect. 

Respiratory System 

Lung development is a continuous process from embry¬ 
onic life to adolescence. At birth, about 85% of the 
human alveoli are present. Alveolar number and lung 
surface area increase through childhood and begin to 
level off between 2 and 4 years of age, whereas 
lung expansion continues up to 8 years of age [5]. 
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Immature (neonatal) differentiating cells of the respira¬ 
tory tract are more sensitive to injury following exposure 
to respiratory toxicants than mature cells, and at dose 
levels that cause no effects in adult cells. Children are 
usually physically active, and have greater exposure to air 
pollutants. Because of their higher metabolic rates, they 
breathe more rapidly and inhale more pollutants per 
kilogram of body weight than adults. Their narrower 
airway passages compared to adults can be more easily 
obstructed in a greater proportion than in adults. Thus, 
because of the immaturity of the lungs during childhood 
and the greater exposures relative to body weight in 
children than in adults, it is inferred that children may 
be more susceptible to the effects of respiratory toxicants 
than adults, whose lung growth is complete. 

Air pollution has been extensively studied in rela¬ 
tion to respiratory health and in general it has been 
easier to demonstrate the effects of air pollution on 
exacerbation of certain conditions than on the causing 
of disease in previously normal individuals. Both out¬ 
door and indoor air pollution have been identified as 
potential risk factors for both the initiation/induction 
and the exacerbation of respiratory diseases, especially 
asthma. Evidence of the effect of air contaminants 
such as ozone and ETS on lung function has been 
demonstrated in both animal and human studies. Air 
pollution is now linked to SIDS [103]. 

Indoor Air Pollution Indoor air pollution is the 
most important source of respiratory exposure to tox¬ 
icants in children, given the fact that children spend up 
to 90% of their time indoors and the large range of 
pulmonary irritants that can be found in the home. 
Exposure to pollutants in the home environment in 
developed countries has increased with improved insu¬ 
lation and reduced ventilation and the use of chemical 
detergents and building or furnishing constituents that 
contain noxious pulmonary irritants [5]. Infants and 
young children in particular have little control over 
their exposure to pollutants in the home environment 
and are vulnerable to the activities of the adults 
(particularly ETS). Common indoor air pollutants 
include nitrogen dioxide, formaldehyde and other 
volatile organic compounds (VOCs), and ETS. 

Environmental Tobacco Smoke Environmental 
tobacco smoke has been extensively studied in relation 


to children s respiratory health. The developing fetus 
can be involuntarily exposed to tobacco during preg¬ 
nancy through a smoking mother or by the pregnant 
mother’s exposure to environmental tobacco smoke. 
Exposure may continue throughout childhood if any 
of the parents smoke. The early exposures may have 
persistent adverse effects throughout life. 

There is evidence of ETS effects on the increased 
risk of lower respiratory illness rates, especially in the 
first year of life, increased rates of chronic middle ear 
effusion in children, impairment of lung function and 
exacerbation of certain conditions such as asthma. 
Exposure to environmental tobacco smoke is a risk 
factor for sudden infant death syndrome. Children 
exposed to ETS are more likely to suffer from respira¬ 
tory illness (bronchitis, pneumonia) and to be hospi¬ 
talized because of the illness than unexposed children 
[ 103] . EPA estimates that between 150,000 and 300,000 
of bronchitis and pneumonia cases annually in infants 
and young children up to 18 months of age are attrib¬ 
utable to exposure to ETS. Of these, between 7,500 and 
15,000 will result in hospitalization. Furthermore, there 
is evidence that exposure to ETS leads to increased 
infant mortality from respiratory illness as well as 
increased morbidity [104]. 

Exposure to pollutants during critical periods of 
lung development may have effects that would not be 
seen if exposure occurred during adulthood [105]. 
Maternal smoking during pregnancy was related to 
impaired lung function in newborn infants [106]. 
Permanent effects of parental smoking on lung func¬ 
tion were found in adults of 30-59 years of age [107], 
indicating that the impairment of lung function 
persists through life. 

The exacerbating effect of ETS on asthma has been 
known for some time, and many studies evidence this 
effect. ETS exposure increases the frequency of episodes 
and severity of symptoms and the rates of hospitaliza¬ 
tion in asthmatic children. In addition, ETS exposure is 
a risk factor for new cases of asthma in children who 
have not previously displayed symptoms (EPA http:// 
www.epa.gov/smokefree/healtheffects.html) [69] . 

Another group of environmental agents that have 
been studied in relation to respiratory health are the 
bioaerosols. Bioaerosols include inhaled allergens 
(house dust mites and other insects, molds, pets, 
pollens) and bacterial (lipopolysaccharide) and 
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fungal (glucans) products. Sensitization to one or more 
common inhalant allergens is consistently associated 
with childhood asthma especially in developed coun¬ 
tries [107]. An expert panel concluded that house-dust 
mite allergens produce incident (new onset) asthma 
and that cat, cockroach, and house-dust mite allergens 
induce episodes in sensitized asthmatics [96] . However, 
the relationship between exposure to inhaled allergens 
in early life and the development of asthma or wheeze 
in childhood is controversial [107]. Contrarily, it has 
been hypothesized that early exposure to some aller¬ 
gens may be protective of later development of asthma 
[109], although this hypothesis was discarded in later 
studies [110]. 

Outdoor Air Pollution Air pollution, from both 
vehicular and stationary sources, is associated with an 
increase in respiratory symptoms, a decrease in lung 
function, and the exacerbation of asthma symptoms 
among asthmatic children. Recent studies have 
evidenced the relationship between traffic air pollution 
and incident development of asthma. Most of these 
effects occur at levels within the ambient air quality 
standards of most countries that have the potential to 
affect a large population of children. Air pollution 
effects contribute significantly to childrens respiratory 
morbidity and have a significant economic impact in 
terms of health expenses. Most studies attribute respi¬ 
ratory symptoms to particulate matter, although 
the close correlation of particulate matter levels with 
nitrogen dioxide and sulfur dioxide levels makes the 
contribution of individual pollutants difficult to deter¬ 
mine [5]. 

There is a consistent body of evidence that outdoor 
air pollution is associated with increased respiratory 
symptoms, such as cough, bronchitis, respiratory 
infections, and upper respiratory tract illness in chil¬ 
dren [111]. Several studies have demonstrated an 
increase in respiratory symptoms such as cough and 
bronchitis associated with increases in PM 10. A recent 
study in Eastern Germany evaluated the association 
between respiratory symptoms and air pollution levels 
before and after political reunification. A reduction in 
air pollution since reunification is associated with 
reductions in the rates of chronic cough and bronchitis 
symptoms in a new cohort of children, suggesting 
a potentially reversible effect of air pollution [112]. 


A similar dramatic effect was observed with children 
who moved out of an area to other areas of higher 
and lower PM 10 concentrations. Those children who 
moved to areas of lower PM 10 concentrations 
showed higher rates of growth in lung function, 
whereas the opposite effect was observed in children 
who moved to areas of higher PM 10 concentrations 
than before [113]. 

The role of air pollution in the exacerbation of 
asthma is well established, and there is mounting evi¬ 
dence that air pollution, particularly from traffic, is 
implicated in the pathogenesis of asthma [114]. 
Because children with asthma have increased airway 
reactivity, the effects of air pollution on the respiratory 
system can be more serious for them. Children with 
asthma have been shown to experience more respira¬ 
tory symptoms, use extra medication, produce chronic 
phlegm, and have more bronchitis following exposure 
to high levels of particulate pollution [115, 116]. 
Air pollution is also associated with increased 
school absenteeism due to respiratory illness [117], 
and increased admissions to hospital emergency 
department [118]. 

The role of the different components of air pollu¬ 
tion in the development and exacerbation of asthma is 
difficult to elucidate. Oxidant gases such as nitrogen 
oxide and ozone have been associated with asthma 
prevalence [119]. Other studies suggest that traffic pol¬ 
lution, but probably not N0 2 from traffic, is associated 
with atopy and wheezing [120]. The authors suggested 
that diesel particles or some component of those par¬ 
ticles, such as polycyclic aromatic hydrocarbons, may 
be the most important etiologic component. A recent 
review of the short-term effects of PM10 and N0 2 on 
respiratory health among children with asthma or 
asthma -like symptoms concluded there were clear 
effects of PM 10 on the occurrence of asthma symptom 
episodes, and to a lesser extent on cough and PEF 
(pulmonary expiratory function). Results with respect 
to N0 2 were inconclusive [121]. 

Of great relevance have been recent studies that 
have related proximity to heavily trafficked roads with 
asthma and reduced lung functions as well as other 
respiratory symptoms such as wheeze and dry cough. 
Lin et al. [122] found significant odds ratios for living 
within 200 m of a street with the highest tertile of traffic 
density and asthma prevalence, and the children with 
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asthma were more likely to have truck traffic on their 
street. Although some studies showed no increased risk 
[123], the weight of evidence suggests that traffic 
pollution is associated with the risk of developing 
asthma [114]. 

Lung function is also affected by air pollution. 
Acute exposure to ozone, nitrogen dioxide, sulfur diox¬ 
ide, and particulate matter is known to cause transient 
reversible decreases in lung function [124, 125]. Some 
recent studies indicate that long-term exposure to 
ozone and related co-pollutants (individually and syn- 
ergistically) is associated with impairment of lung 
function capacity in children and adolescents [126], 
although other studies are not consistent. Further, it 
has been hypothesized that a fraction of the increases in 
the prevalence of chronic obstructive lung disease in 
adults who live in more polluted areas could be the result 
of exposures that occurred during childhood [127]. 

Cancer 

Childhood cancers are relatively rare diseases during 
the first 2 decades of life, but are the leading cause of 
death of children in many countries. The most com¬ 
mon types of childhood cancer are lymphoid neo¬ 
plasms (leukemia, lymphoma) and cancers of the 
central nervous system. Other kinds of childhood 
tumors include embryonal tumors of the retina, sym¬ 
pathetic nervous system, kidney, and liver; tumors of 
bone and soft connective tissues; and certain gonadal 
neoplasms. Carcinomas in epithelial tissues, the most 
frequent type of cancer in adults, are rare in children. 
Neoplasms in adults resulting from known or iatro¬ 
genic exposure typically have latency periods of 
20 years or more. Thus, cancers in children are quali¬ 
tatively distinct from those in adults. 

Although childhood cancer is a rare disease, the 
occurrence of new cancer cases among children has 
continued to rise during the last 2 decades. While 
medical advances have led to a sharp reduction in 
mortality from childhood cancer, it is still a group of 
diseases with potentially devastating outcomes, which 
can be at least partially prevented; so the study of the 
environmental causes of cancer is essential. A recent 
report from the European Automated Childhood Can¬ 
cer Information System (EACCIS) provides evidence of 
a 1% increase per year in childhood cancers and 1.5% 


increase per year in adolescent cancer in Europe for the 
period 1970-1999. All of the common types of neo¬ 
plasms showed significant increases, including leuke¬ 
mias, lymphomas, central nervous system tumors, 
neuroblastomas, soft tissue sarcomas, retinoblastoma, 
and germ cell, renal, hepatic, and bone tumors in 
children and carcinomas, lymphomas, soft-tissue sar¬ 
comas, and germ cell and CNS tumors in adolescents. 
There were in addition significant differences between 
Eastern and Western Europe with regard to leukemias, 
lymphomas, carcinomas, and central nervous system 
tumors in children, and carcinomas, lymphomas, leu¬ 
kemias, and soft-tissue tumors in adolescents. Earlier 
reports of European populations also noted increases 
in several of these types of childhood cancers. Similarly, 
in the USA, there was an overall increase of about 25% 
in childhood cancers between 1975 and 2000. While the 
reported increases in the 1980s may have reflected 
improvements in diagnosis and reporting, it is unlikely 
that the most recently reported trends reflect the same 
bias. Evidence from both epidemiological and mutage¬ 
nicity/carcinogenicity studies suggest that environ¬ 
mental toxicants may be involved at least in the 
causation of some forms of cancers such as acute lym¬ 
phoblastic leukemia (ALL), which is most common in 
industrialized countries. 

Most cancers result from the interaction between 
genetic factors and the environment [128]. In this con¬ 
text, environment is defined broadly and includes diet, 
alcohol, drugs, tobacco, and all other nongenetic factors 
as well as classic environmental toxins. It has been esti¬ 
mated that about 80-90% of all cancers are attributable 
to environmental factors acting in conjunction with 
both genetic and acquired susceptibility. For cancer in 
adults, different opportunities for environmental expo¬ 
sure have commonly been considered a major (albeit not 
exclusive) reason for the geographical distribution of 
cancer. Geographical differences in incidence of most 
cancers (except lymphomas) are less marked for child¬ 
hood than for adult cancer, thus suggesting that the 
fraction of childhood cancers due to environmental fac¬ 
tors is probably lower than for adults [128]. 

The potentially higher susceptibility of the devel¬ 
oping fetus and children to the effect of environmental 
carcinogens is based on several aspects. Chemical car¬ 
cinogenesis is a multistep process involving genetic and 
epigenetic changes in susceptible cells that gain 
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a selective growth advantage and undergo clonal 
expansion as the result of activation of protooncogenes 
and/or inactivation of tumor-suppressor genes. The 
occurrence of these events is modulated by several 
factors that themselves change in the course of devel¬ 
opment. Thus, DNA damage may be repaired by the 
action of DNA repair enzymes, whose presence and 
activity may change with age. Similarly, changes in 
metabolism will determine whether carcinogenic 
metabolites are formed, thus allowing for the initiation 
of the cancer process. 

DNA repair mechanisms play an important role in 
cancer protection. DNA repair enzymes have been 
found to be well expressed in embryos and fetuses. In 
fact, there are a number of DNA repair genes/activities 
that have higher expression in fetuses or embryos than 
in adults. Interference with the synthesis of DNA repair 
enzymes during development may result in increased 
susceptibility to cancer during childhood or later in 
life. An example of the important role of DNA repair 
mechanisms is the p53 gene, which encodes a protein 
that modulates DNA repair and cell division. Muta¬ 
tions of the p53 genes are involved in at least 50% of all 
cancers [129]. p53 mutations have been linked with 
tobacco smoking [130], and it is possible that p53 
mutations occur in the offspring of smoking mothers. 

The higher rates of cell proliferation during develop¬ 
ment can contribute to increased likelihood of carcino¬ 
genesis. For example, PAHs and aflatoxin B1 (AFB1) 
produce liver tumors when administered to newborn 
rodents but not when administered to older animals, 
presumably because the liver proliferates rapidly in the 
developing system but more slowly in older animals 
[131]. Women who were in their teens at the time of 
atomic bombings had the greatest risk of radiation- 
induced breast cancer [132]. Some organs, such as the 
brain, are fully developed in early childhood, whereas 
others such as the skeletal system do not achieve matu¬ 
rity until adolescence [131]. This may be the reason why 
osteosarcoma, the most common bone cancer, peaks in 
late adolescence, a period of rapid bone growth. 

The immune system of the newborn is not fully 
developed until about 6 months of age. Chemicals 
that affect the immune system, such as halogenated 
aromatic hydrocarbons that bind to the Ah receptor 
[ 133] , may modify the host defense mechanism against 
infection and cancer [134]. 


The susceptibility of children to cancer may be 
influenced by the presence/absence of metabolizing 
enzymes. The role of these enzymes is potentially com¬ 
plex, especially for those that carry out the phase I 
(usually detoxifying) reactions. Usually, chemical 
metabolism protects the adult and fetus from carcino¬ 
genicity, but the activity of enzymes can also result in 
bioactivation, such as in the case of bioactivation of 
benzo[ a] pyrene to its carcinogenic form, which has 
been demonstrated by the observation of the formation 
of DNA adducts in mice and monkey fetuses, which 
probably originated in maternal liver [135]. 

The first step in metabolism is usually oxidation, 
carried out by cytochrome P450 enzymes. This group 
of enzymes are named as CYP enzymes and grouped in 
families according to their genotype and activity. 
Several groups of these enzymes are involved in the 
metabolism of exogenous carcinogens, such as the 
CYP families 1-3, which are thought to be the most 
relevant to effects of exogenous carcinogens. Other 
examples are the CYP4B1, which activates aromatic 
amines, and CYP2E1, which metabolizes nitrosamines 
and organic solvents that have been implicated in cau¬ 
sation of childhood cancers. 

Phase II detoxification enzymes have received less 
attention than CYPs in the perinatal context, though 
they are also likely to be important. For example, 
among the glutathione S-transferases (GSTs), the 
I (P) form is apparently expressed at the highest levels 
and in many tissues in human fetuses from early in 
gestation, and levels of total GST activity in all fetal 
tissues studied was comparable to that in the 
corresponding adult organ [135]. An increasing body 
of evidence implicates GST polymorphisms in risk of 
childhood leukemias. Glucuronidation, a type of 
Phase II reaction, catalyzed by uridine diphosphate- 
glucuronosyltransferase (UDG), is a key detoxification 
step for solubilization of high molecular weight carcin¬ 
ogens such as polycyclic aromatic hydrocarbons, 
heterocyclic amines, and tobacco-specific nitrosa¬ 
mines. Low expression of UDG in the fetus and neo¬ 
nate, and as a result of polymorphisms in children, 
could well increase sensitivity to carcinogenesis by 
some chemicals. Sulfotransferases (SULT) have been 
proposed to constitute a major enzymatic detoxifica¬ 
tion system in the fetus [136, 137]. However, these 
enzymes also catalyze activation of several types of 
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chemical carcinogens to DNA-damaging forms, 
including aromatic amines, polycyclic aromatic hydro¬ 
carbons, heterocyclic amines, 3-nitrobenzanthrone, 
and benzylic alcohols. This illustrates how alteration 
of enzymatic metabolism can greatly influence suscep¬ 
tibility to perinatal carcinogenesis. 

While mutagenicity/carcinogenicity studies, as well 
as the study of enzymatic systems and DNA repair 
enzymes, provide mounting evidence that the interplay 
between chemical agents and these biological factors 
are significant in the development of cancer, epidemi¬ 
ological studies have only been of limited usefulness to 
confirm this hypothesis. The increasing trends over the 
last 20-30 years and the geographical distribution of 
childhood cancers, clearly point to environmental 
causes. However, few studies have been able to firmly 
establish the environmental origin of childhood can¬ 
cers. Cumulative epidemiological evidence pertaining 
to childhood cancers, including international varia¬ 
tion, time trends, and risk factor studies was reviewed 
and analyzed by Bunin [138]. This review concluded 
that ionizing radiation and a variety of genetic condi¬ 
tions are thought to explain 5-10% of childhood can¬ 
cers. There are clear associations between Epstein-Barr 
virus infection and Burkitts lymphoma in Africa, and 
between human immunodeficiency virus and Kaposi’s 
sarcoma [138]. 

Other risk factors have not been conclusively iden¬ 
tified. Among the nongenetic causes, the pattern of 
international variation and associations with surro¬ 
gates of infection suggested an infectious etiology for 
acute lymphoblastic leukemia, although no agent has 
been identified. For brain tumors, cured meats, 
polyomaviruses, and farm exposures were pointed out 
as potential causes. Changes in the incidence and char¬ 
acteristics of children with hepatoblastoma as well as 
risk factor studies suggest a role for an exposure of very 
low birth weight babies. High birth weight, tea or coffee 
consumption, and certain paternal occupations have 
shown some consistency in their association with 
Wilms’ tumor. For most of the other cancers, very few 
epidemiologic studies have been conducted, so it is not 
surprising that nongenetic risk factors have not been 
detected. The most important difference between the 
cancers for which there are good etiologic clues and 
those for which there are not may be the number of 
relevant studies. 


This section summarizes the evidence found in the 
recent literature of the associations between environ¬ 
mental factors and childhood cancer. Attention is given 
only for those agents for which there is at least limited 
evidence of an association between environmental 
agent and disease. 

Ionizing Radiation 

Evidence of the potential of ionizing radiation to 
induce childhood cancer comes from events such as 
Hiroshima and Nagasaki, populations affected by acci¬ 
dents at nuclear plants, as well as investigations on the 
longer effects of indoor radon and on the consequences 
of exposure to x-ray for diagnostic or therapeutic 
reasons. 

Japanese children exposed to the atomic bombings 
of Hiroshima and Nagasaki had increased risks of adult 
cancers, including leukemia and solid tumors. The 
highest risk was that of children who were exposed in 
utero. Radiation-related leukemia started to occur 2-3 
years after the bombing, reached its peak within 
6-8 years, and has declined steadily since then. For 
people exposed as adults, the excess risk was lower 
than that of people exposed as children, but the excess 
risk appears to have persisted throughout the follow-up 
period [5]. 

Thyroid adult carcinomas were also increased in 
survivors who were children at the time of the bomb¬ 
ings at Hiroshima and Nagasaki. The greatest risk for 
thyroid cancer occurred in individuals who received 
a radiation dose to the thyroid greater than 1 Sv before 
age of 10 years. Similarly, breast cancer risk among 
women survivors to the atomic bombings was highest 
for the women who were less than 10 years of age or 
between 10 and 20 years of age, lower for women who 
were between 20 and 40, and even lower for women 
exposed after 40 years of age [132]. 

Prenatal diagnostic X-irradiation has also been 
linked to increased risk of leukemia in offspring, as 
has therapeutic, high-dose, ionizing radiation in child¬ 
hood for other cancers and for various non-neoplastic 
conditions [139]. 

The Chernobyl nuclear reactor accident in 1996 
caused a significant increase in the incidence of thyroid 
cancer in children since 1990. Most of the tumors have 
been observed among individuals who were very young 
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at the time of the accident [31]. In Belarus, over half of 
the tumors occurred in people who were less than 
6 years old at the time of the accident. In a series of 
472 children with thyroid cancer diagnosed up to 1995 
in Belarus, only 2% had been conceived after the acci¬ 
dent, 9% had been exposed in utero, and 88% were 
under 15 years of age at the time of diagnosis [ 140] . The 
very early age at which the thyroid cancers have begun 
to be diagnosed is one of the most striking examples 
of the special sensitivity to a carcinogen other than 
an infectious agent occurring exclusively during 
preadult life. 

UV Light 

Sun exposure is known to be a risk factor for the 
development of skin cancer later in life. The occurrence 
of sunburn is an indicator of risk [141]. Numerous 
studies have assessed the carcinogenic effect of sunburn 
at different ages and concluded exposure to solar radi¬ 
ation before 10 years of age was a primary contributor 
to risk of melanoma. The IARC concluded that child¬ 
hood is an especially vulnerable life stage [142] to the 
carcinogenic effect of UV radiation. While the use of 
sunscreen reduces the risk of sunburn, an expert work¬ 
ing group convened by the International Agency for 
Research warned against the risk of relying solely on 
sunscreens for protection from ultraviolet radiation 
[ 143] , as the use of sunscreens may lead to an extension 
of the duration of intentional sun exposure, which can 
increase the risk of melanoma. 

Environmental Tobacco Smoke 

Environmental tobacco smoke is an established human 
carcinogen by the International Agency for Research on 
Cancer [144]. Over 50 epidemiological studies have 
reported risk ratios of lung cancer for secondhand 
smoking in adults. A recent meta-analysis of epidemi¬ 
ological studies of lung cancer and adult exposure to 
environmental tobacco smoke resulted in risk ratios of 
1.22 in women and 1.36 in men from workplace expo¬ 
sure. Other meta-analyses have produced similar 
results. The evidence of a causal relationship between 
ETS exposure and cancers in organs other than the lung 
is inconclusive. 

Tobacco smoke contains many carcinogens, such as 
PAHs and 4-aminobiphenyl, that can cross the placenta 


and be transferred to the fetus. The genotoxicity of 
tobacco smoke to the fetal liver has been tested in an 
animal study. Sister chromatid exchange in the liver 
cells of fetal mice was analyzed at the 16th day of 
gestation after short-term exposure (twice, on the 
15th and 16th days of gestation), long-term exposure 
(starting 4 weeks before mating and stopping on the 
16th day of gestation), and prepregnancy exposure 
(4 weeks before mating). The number of sister chro¬ 
matid exchanges was significantly increased in all expo¬ 
sure groups, and long-term exposure caused 
a significantly higher increase than did short-term 
exposure [145]. 

Most epidemiological studies on the link between 
exposure to ETS and childhood cancer have focused 
mainly on pregnant women. While most of studies of 
smoking by mothers reveal no effects on childhood 
cancer, many studies of smoking by fathers have 
shown a significant association with risk of cancer in 
their children, and when both mothers and fathers are 
included in the study, the effect appears to be greater 
for exposure of fathers than of mothers [135]. This 
effect has been attributed to germ-cell mutations dur¬ 
ing spermatogenesis caused by tobacco products [ 146] . 
Three reports from the Oxford Survey of Childhood 
Cancers have suggested that paternal but not maternal 
cigarette smoking is associated with increased risks for 
the generality of childhood cancers. Some, however, 
have produced conflicting findings. A large case control 
study conducted in the UK (the United Kingdom 
Childhood Cancer Study) concluded that there was 
no evidence that paternal smoking is a risk factor for 
childhood cancer in general [147]. A significant asso¬ 
ciation was reported in this study, however, between 
hepatoblastoma risks and smoking by both parents 
relative to neither parent smoking. Sorahan and Lan¬ 
cashire speculated that the importance of both parents 
smoking in the etiology of hepatoblastoma might arise 
from the combination of oxidative damage to sperm 
DNA and damage to the fetal liver from carcinogenic 
metabolites in the blood of the pregnant mother [ 148] . 
Further, in a study where both preconception and 
postnatal smoking by the fathers was quantified, and 
few of the mothers smoked, the association with 
childhood cancer related significantly only to 
preconceptional paternal smoking levels [149]. Thus, 
second-hand exposure of the infants to smoke was less 
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likely. Transplacental effects of sidestream smoke were 
one possibility. In rats, sidestream smoke constituents 
received transplacentally caused oxidative DNA dam¬ 
age in fetal tissues [136]. In one epidemiological study, 
paternal smoking had a larger possible effect when 
mothers were nonsmokers [137], suggesting protective 
effects of detoxification enzymes induced in the pla¬ 
centa and in maternal tissues by maternal smoking. 

Some studies have examined the relationship 
between exposure to environmental tobacco smoke 
during childhood and cancer risk. Sandler and col¬ 
leagues [150] found that the overall cancer risk was 
greater for individuals with exposures to environmen¬ 
tal tobacco smoke during both childhood and adult¬ 
hood than for individuals with exposure during only 
one period. When specific cancer sites or types were 
considered, leukemia and lymphoma among adults 
were significantly related to exposure to maternal pas¬ 
sive smoke before 10 years of age [150]. 

Pesticides and Cancer 

Pesticides are biologically active molecules that are 
commonly used to destroy unwanted organisms in 
agricultural and residential environments. The wide¬ 
spread use of these chemicals has raised concerns over 
the potential of pesticides to cause childhood cancer. 

Although the biochemical mechanisms relating 
pesticide exposures to childhood cancer have not 
been fully described, some evidence suggests that pes¬ 
ticides may promote the formation of chromosomal 
aberrations known to be associated with an increased 
cancer risk [149]. Studies in adult populations suggest 
that pesticide exposures may have a direct effect on 
chromosome structure, and therefore a causal relation¬ 
ship between pesticide exposures and childhood cancer 
is plausible [152]. 

Epidemiological studies have reported associations 
between childhood cancer and either parental or child 
exposures to pesticides. Research reviews have 
suggested an increase in the risk of brain cancer, leuke¬ 
mia, non-Hodgkins’s lymphoma (NHO), Wilms’ 
tumor, Ewing’s sarcoma, and germ cell tumors associ¬ 
ated with parental occupational and nonoccupational 
exposure to pesticides [153, 154]. The exposures 
observed occurred prior to and during pregnancy, as 
well as after the childbirth, thus involving different 


potential modes of action. Zahm and Ward [154] con¬ 
cluded that at least some childhood cancer could 
potentially be prevented by reducing or eliminating 
pesticide exposure, although methodological limita¬ 
tions common to many studies limit the possibility of 
making conclusions regarding the role of pesticides in 
the etiology of childhood cancers [153, 154]. 

Two extensive reviews were conducted after the 
Zahm and Ward review [154] that attempted to pro¬ 
vide conclusions on the basis of cumulative evidence. 
In a review that evaluated 18 new studies conducted 
between 1998 and 2004, it was concluded that while 
collectively all studies seem to suggest an increase in the 
risk of different cancer types associated with exposure 
to pesticides, no conclusions could be drawn with 
respect to cancer types as well as to specific causative 
factors across studies [153]. Infante-Rivard and 
Weichenthal [152] reviewed studies conducted 
between 1999 and 2004 and critically evaluated the 
evidence on the associations between pesticide expo¬ 
sures and leukemia (12 studies), brain cancer (10 stud¬ 
ies), neuroblastoma (4 studies), non-Hodgkin’s 
lymphoma (3 studies), Wilm’s tumor (2 studies), and 
Ewing’s sarcoma (1 study), as in the Zahm and Ward 
review. The authors found recent studies to be consis¬ 
tent with the suggestion of Zahm and Ward of an 
association between pesticide exposure and childhood 
leukemia, brain tumors, neuroblastoma, and also non- 
Hodgkin’s lymphoma and Wilm’s tumor. Specifically 
childhood exposure to household insecticides and pre¬ 
natal exposure to pesticides seemed to pose the greatest 
risks of leukemia and brain tumors. Exposure-response 
gradients were observed in some studies on pesticide 
exposure and risk of leukemia. For neuroblastoma, the 
authors found that all four recent studies reviewed 
showed an association between exposure of pesticides 
and risk of neuroblastoma that supported earlier find¬ 
ings by Zahm and Ward. A recent study also indicates 
that residential use of pesticides, and herbicides specif¬ 
ically, may increase the risk of neuroblastoma in chil¬ 
dren [153]. The risk of non-Hodgkin’s lymphoma was 
found to be associated with residential exposure to 
pesticides, and two of the studies reviewed provided 
evidence of exposure response gradients in two of the 
three reviewed studies. With respect to Wilm’s tumor, 
recent studies did not provide additional evidence of an 
association between insecticide exposures and parental 
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pesticide exposure before birth and Wilms’ tumor that 
had been indicated in Zahm and ward’s review. The 
evidence on Ewing’s sarcoma was inconclusive. 

A systematic review and meta-analysis of studies 
on childhood leukemia and parental occupational 
exposure to pesticides concluded that there was not 
sufficient evidence to affirm an overall association 
between childhood leukemia and any paternal occu¬ 
pational pesticide exposure among all studies com¬ 
bined or subgroups of studies [155]. An elevated 
childhood leukemia risk was found in relation to pater¬ 
nal occupational exposure to the broad pesticide classes 
of insecticides and herbicides, but the authors consid¬ 
ered that the small number of studies showing this 
effect and the lack of exposure-risk relationships did 
not allow making firm conclusions. However, an asso¬ 
ciation was found between childhood leukemia and 
prenatal maternal occupational pesticide exposures. 
The studies also showed associations between child¬ 
hood leukemia and maternal occupational exposure 
to insecticides and herbicides. It was concluded that 
the overall evidence, though limited, warranted expo¬ 
sure prevention measures on the basis of the precau¬ 
tionary principle [ 152] . 

Many authors have pointed out the major short¬ 
comings of epidemiological studies on pesticides expo¬ 
sures and childhood cancer [156]. One of the major 
shortcomings is exposure assessment, as many of the 
studies use very unspecific measures of exposure, such 
as “farming” as a measure of “exposures to pesticides.” 
In most of the studies, the specific pesticide/s, timing 
and other concomitant exposures to which the popu¬ 
lation was exposed is not known. The authors 
recommended directing research efforts to the charac¬ 
terization of pesticide exposures in future research. 
Further they have remarked that “another 40 epidemi¬ 
ological studies similar to the majority of those 
conducted thus far will not provide clarity.” 

In spite of the limitations of epidemiological 
research on pesticide exposures and childhood cancers, 
many researchers find that there is sufficient evidence 
to be concerned about the potential role of pesticides in 
childhood cancers, and to recommend a precautionary 
approach. Infante-Rivard [152] contrasted the overall 
evidence against the Bradford Hill’s causality criteria 
(strength, consistency, specificity, temporality, biologi¬ 
cal gradient, plausibility, coherence, experiment and 


analogy [157]). The strongest evidence in support of 
a causal relationship between pesticide exposure and 
childhood cancer was considered to be the repeated 
detection of statistically significant increased risks 
between childhood pesticide exposure and cancer. 
The authors concluded that there is sufficient evidence 
to conclude that there is at least some association 
between pesticide exposure and childhood cancer. In 
addition, the biological gradients observed in recent 
studies also suggest that there may be a causal relation¬ 
ship between childhood insecticide exposures and the 
development of ALL and NHL [158, 159]. Infante- 
Rivard [152] indicated that the development of 
childhood cancer probably depends on the presence 
of many factors, including genetic predisposition, and 
recommended the use of improved exposure assess¬ 
ments that include separate parental interview, specific 
pesticide exposure questions, and semiquantitative 
exposure measures that can be used to confirm expo¬ 
sure information obtained through questionnaires. 

Disinfection By-products and Cancer 

Disinfection by-products (DBPs), such as trihalometh- 
anes (THMs), are regulated carcinogens in drinking 
water and have been detected in the blood and breath 
of swimmers and of nonswimmers at indoor pools. 
There are a few data on the effects of low doses on 
humans, particularly infants and children. 

Villanueva et al. [160] conducted a pooled analysis 
of six epidemiological studies and calculated a sum¬ 
mary relative risk of bladder cancer equal to 1.18 (95% 
Cl 1.06, 1.32) for exposure above 1 pg/L of trihalo- 
methanes. Boffetta [161] estimated the attributable 
fraction of bladder cancer on the basis of these figures 
to be about 10.3%. 

Arsenic 

Inorganic arsenic is a human carcinogen [162] which 
causes bladder, skin, and lung cancers in humans. At 
least in mice, inorganic arsenic is a much more potent 
carcinogen to the fetus than to adults. The modes of 
carcinogenic action of inorganic arsenic in rodents and 
in humans are not yet fully understood, but the possi¬ 
bility exists that inorganic arsenic in drinking water 
poses a special carcinogenic concern for pregnant 
women and their unborn infants. 
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Air Pollution and Childhood Cancer 

There have been several studies during the last decade 
that have linked childrens exposure to air pollutants to 
childhood cancer, leukemia in particular. Infante- 
Rivard [163] evaluated the results of epidemiological 
studies conducted between 1998 and 2008 on diverse 
chemical exposures and childhood leukemia. The review 
included nine case-control studies and four ecological 
studies. The latter showed an association between inci¬ 
dence rates of childhood leukemia and levels of air 
pollution in the area of residence of the population at 
the time of diagnosis. The lack of information about the 
place of residence of subjects prior to diagnosis (includ¬ 
ing during the preconception and gestation period) 
made it difficult to conclude an association between 
the exposure levels and risk of childhood leukemia. The 
review concluded that the weight of evidence from both 
case-control and ecological studies indicated no 
increased risk for childhood leukemia associated with 
exposure to traffic-related residential air pollution. The 
same conclusion was reached in another review 
conducted by Raaschou-Nielsen [164]. 

Endocrine Disruptors and Cancer 

Increases in the incidence of cancer in certain parts of 
the world are often cited as evidence that widespread 
exposure to endocrine disruptors has adverse effects 
on human health. Of particular concern are the 
observed increased incidences of cancer at hormon¬ 
ally sensitive sites, such as breast, uterus, prostate, 
and testis in Europe and North America. These 
increases cannot be explained only on the basis of 
improved diagnostic techniques, and it has been 
argued that these trends coincide roughly with the 
increasing use and release of industrial chemicals into 
the environment. Concerns are also based on plausi¬ 
ble mechanisms of action because both human and 
experimental animal studies show that these cancers 
are modulated hormonally. 

Many epidemiological and experimental animal 
studies have attempted to evaluate the link between 
exposure to endocrine disruptors and increased risk of 
breast cancer. A direct association between these 
chemicals and increased risk of breast cancer has not 
been established. However, some researchers claim 
that the time of life when exposure takes place 


(e.g., prenatal, neonatal, childhood, adolescence) 
may have a major influence on the appearance of this 
and other health effects. The development of the 
mammary gland occurs in multiple stages. Fetal devel¬ 
opment of the mammary gland rudiment is governed 
by tissue interactions in both males and females. In 
females, the pubertal period drives ductal morpho¬ 
genesis, and pregnancy results in massive differentia¬ 
tion of the mammary gland. Thus, the perinatal 
period and the period between age at menarche and 
age at first full-term pregnancy may be particularly 
important for breast tumor development and latency 
[165]. Young girls exposed to carcinogenic agents dur¬ 
ing puberty may be at high risk of future breast cancer 
due to susceptibility of rapidly growing breast tissue 
mediated by hormonal changes during this time. This 
claim is supported by data from atomic bomb survi¬ 
vors, where an increased risk of breast cancer was found 
in women exposed before 20 years of age [132]. Simi¬ 
larly, an elevated risk was found for women irradiated 
during childhood for medical reasons [166]. 

DES was extensively prescribed in developed coun¬ 
tries from the late 1940s through the 1970s to women 
with high-risk pregnancies to prevent miscarriages and 
other complications of pregnancy. In the early 1970s, 
a rare form of female reproductive tract cancer, clear 
cell adenocarcinoma began to be detected among 
women whose mothers had taken DES during preg¬ 
nancy [167]. Although clear cell adenocarcinoma 
occurs in only 0.1% of women who were exposed to 
DES in utero, this represents a 40-fold increased risk in 
comparison with the nonexposed population. In con¬ 
trast, men who were exposed to DES in utero do not 
have a clear increased risk of any cancer, although 
a statistically nonsignificant threefold increased risk of 
testicular cancer has been reported [ 168] . While there is 
no epidemiological evidence for a link between expo¬ 
sure to estrogenic or anti-androgenic compounds and 
testicular cancer, some authors have hypothesized that 
the similar increases in the incidence of testicular 
cancer and in the incidence of cryptorchidism and 
hypospadias in similar geographical areas, suggest 
a common cause of similar environmental origin in 
both health effects [85]. 

Some studies have shown that women taking DES 
during pregnancy to prevent miscarriage have been 
shown to have a slightly increased risk of developing 



Environmental Toxicology: Children at Risk 


3649 



breast cancer 30 years after taking the drug [169]. 
Data on the risks of breast cancer in daughters of 
DES-exposed women are not yet available. 

Future Directions 

During the last 2 decades, there has been an exponential 
increase of scientific literature about the susceptibility 
of children to the effects of environmental exposures. 
Evidence is accumulating that toxic chemicals are 
responsible for at least some of the changing patterns 
of disease. Well-studied examples include adenocarci¬ 
noma of the vagina in girls exposed prenatally to DES; 
asthma and pneumonia caused by smoke and particu¬ 
late air pollutants; neurodevelopmental toxicity in 
infants exposed to lead, PCBs, methylmercury; and 
small head circumference at birth in infants exposed 
in utero to organophosphate pesticides. 

Past discoveries of etiologic associations between 
toxic environmental exposures and diseases in children 
have led to successful programs of exposure control 
and disease prevention. Examples include reductions 
in the use of alcohol and tobacco during pregnancy, 
minimization during pregnancy of diagnostic x-rays, 
and removal of lead from gasoline. Sadly, though, the 
interval between initial recognition or suspicion of 
effects and their eventual control has been typically 
been far too long. Early warnings have frequently 
been ignored. The price of delayed action has been 
widespread increases in the incidence of certain dis¬ 
eases such as asthma and cancer. 

When evaluating the health and social impact of 
environmental threats on children, it is necessary to 
take into account, not only their effects on childhood 
health, but also the long-term potential health effects 
throughout the lifelong span of the individual. It has 
been hypothesized that early exposure to environmen¬ 
tal toxicants could affect the brain later in life. Consen¬ 
sus among scientists is based on experimental studies 
on associations between early life exposures to pesti¬ 
cides and Parkinson s disease, as well as on epidemio¬ 
logic studies of the toxic and apparently irreversible 
effects on the developing brain of in utero exposure 
to lead, methylmercury, and polychlorinated biphenyls. 
A mechanistic hypothesis proposed that early exposure 
to neurotoxic chemicals reduce the number of neurons 
in critical areas of the brain such as the substantia nigra 


to levels below those needed to sustain function in the 
face of neuronal attrition associated with advancing 
age. In addition, as some researchers have pointed 
out, the effects at population level would add 
a substantial burden to society, in terms of health, 
economic, and human costs. 

The difficulties in conducting environmental health 
research have been pointed out by many authors dur¬ 
ing the last 3 decades. Human populations are exposed 
to hundreds of chemicals through air, water, and food, 
under many different exposure situations. Environ¬ 
mental exposures are usually low, and often occur 
concomitantly with occupational exposures, smoking 
or naturally occurring agents. And when an association 
between with an exposure and a disease in found, such 
as for example, air pollution and asthma, a long path 
lies ahead to determine the specific causative agent of 
the disease, and the population and individual suscep¬ 
tibility factors that makes some individuals and not 
others become ill. The potential for interaction of dif¬ 
ferent pollutant exposures makes the question with 
respect to the role of environmental exposures impact 
on health even more difficult to answer. The need for 
taking into account multiple and cumulative exposures 
has been clearly affirmed by the EPA and other govern¬ 
ment and research bodies. 

Slowly, evidence is found to confirm or discard 
certain effects, but it seems that progress is too slow to 
catch up to the increases in incidence of certain dis¬ 
eases, such as asthma and cancer. It is possible that the 
statement of Olshan and Daniels [170] with respect to 
pesticide research, that “another 40 epidemiological 
studies similar to the majority of those conducted 
thus far will not provide clarity,” should be applied to 
other groups of chemicals, such as disinfection by¬ 
products, or endocrine disruptors, and that we need 
to think of a more effective way of examining evidence. 
Many authors have pointed out the need of better 
exposure assessment methods. In addition to continu¬ 
ing toxicological research, which is essential to eluci¬ 
date the mechanisms of action, epidemiological 
research should probably incorporate exposure assess¬ 
ment methods that could enable detection of at least 
the specific exposures and exposure levels. Exposure 
assessment can be improved in many ways. In the 
extreme, if the body burden of a list of suspected 
carcinogens in the blood or adipose tissue of each 
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child diagnosed with cancer were determined, more 
specific associations between disease and toxic agent 
may be determined. The path to confirm and rule out 
some proposed environmental causes would probably 
be at least somewhat shorter. While this approach may 
be costly, it may be cost-efficient in the long run. There 
have been some efforts to integrate an environmental 
health perspective in the medical practice. These efforts 
should be further developed and supported. 
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Glossary 

Antioxidant A term loosely defined as something that 
stops oxidation when present at a low amount. The 
term is used most specifically to mean a free radical¬ 
scavenging chemical that accepts or donates an 
electron to a radical to terminate radical chain 
reactions. The term is also used for agents that 
inactivate radical initiation and catalysis, such as 
metal ion chelators, agents that block radiation- 
induced oxidation, and agents that counter oxida¬ 
tion of a substance with reduction. 

Disulfide An intermediate oxidation state of sulfur 
that is readily interconverted with the reduced 
(thiol) state and commonly found in biomolecules. 
Disulfides are important in protein structure, 
function, regulation, and translocation. Disulfide 
formation in proteins and low molecular mass 
sulfur-containing chemicals (glutathione, cysteine, 
Co-enzyme A) is a common and sensitive indicator 
of oxidative stress. 

Electrophile (or electrophilic chemical) A chemical 
with a functional group that is deficient in electron 
density, such as a quinone or conjugated aldehyde, 
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and therefore reactive with a nucleophilic chemical 
containing electron-rich functional groups, such as 
a thiol. Electrophiles are important in toxicology 
because they react with nucleophilic centers in 
DNA, causing mutations and cancer, and with pro¬ 
teins, disrupting enzyme, receptor, transporter, 
gene expression, and other critical functions. 

Lipid peroxidation A radical chain reaction involving 
polyunsaturated fatty acids (PUFA) and 0 2 . The 
process is initiated by agents that abstract 
a hydrogen atom from the carbon skeleton to 
form a carbon-centered radical, propagated by 
addition of 0 2 and abstraction of a hydrogen 
atom from another carbon skeleton with creation 
of a lipid hydroperoxide and another radical, and 
ultimately terminated by reaction of radicals with 
each other to create non-radical species. The pro¬ 
cess causes fats and oils to become rancid and is 
common in decaying organic matter. 

Oxidative stress A process or outcome in which an 
imbalance in prooxidant and antioxidant reactions 
causes macromolecular damage and/or disruption 
of biologic redox signaling and control. All classes 
of macromolecules and any biologic function can 
be affected. Effects on DNA are important in muta¬ 
genesis and cancer, effects on proteins contribute to 
many acute and chronic toxicities, effects on lipids 
contribute to dysfunctional energy metabolism and 
immunity, and effects on carbohydrates affect bio¬ 
logical lubricants in joints and cellular identity. 

Prooxidant An agent that stimulates aberrant electron 
transfer in biologic systems and causes oxidative 
stress. This includes agents that initiate radical reac¬ 
tions, oxidize biologic components, or interfere 
with normal reductive and antioxidant functions. 

Radical (or free radical) An organic molecule with 
an unpaired electron. Radicals are important in 
biologic systems because they are reactive and can 
disrupt biologic functions. Radicals participate in 
a unique type of chemical reaction sequence termed 
a chain reaction. A radical chain reaction, such as 
that illustrated by lipid peroxidation, is initiated by 
an agent that removes (or adds) an electron from 
a non-radical chemical to form a radical. The chem¬ 
ical structure is modified, and the radical product 
becomes a non-radical by accepting or donating an 
electron to another non-radical species, thereby 


propagating the reaction sequence. The process is 
ultimately terminated by reaction of radicals with 
each other to create non-radical species. 

Reactive oxygen species (ROS), sometimes reactive 
oxygen intermediates (ROI) One or more 
oxygen-containing chemicals that are reactive with 
organic molecules. This term is often used to refer 
to superoxide anion radical, hydrogen peroxide, 
lipid hydroperoxide, and related oxygen-centered 
radicals, but may also include reactive nitrogen 
species (RNS), such as nitric oxide, peroxynitrite, 
and other oxides of nitrogen (NOx), which also 
contain oxygen. The term is most often used 
because the chemistry is too complex and/or the 
analytic methods are insufficient to identify specific 
reactive species. 

Redox cycling A process in which a chemical accepts 
an electron from a biological reductant and trans¬ 
fers that electron to 0 2 or other acceptor to create 
aberrant radical generation. Many quinones and 
other aromatic chemicals do this by interacting 
with flavoproteins. The process is considered 
“cycling” because the chemical functions only as 
a catalyst. After donating the electron to an accep¬ 
tor, the original form is regenerated so that it can 
accept another electron. If electrons are transferred 
between redox-signaling pathways, this creates 
a “short circuit,” disrupting cell regulation. 

Redox signaling A mechanism for cellular communi¬ 
cation involving an oxidation-reduction reaction. 
This process commonly involves generation of 
a small diffusible redox-active chemical that trans¬ 
fers a signal to a biologic receptor. Redox signaling 
is highly integrated with kinase signaling and other 
central cell communication mechanisms, and diffi¬ 
cult to discriminate from a more general process of 
redox sensing, which coordinates cellular functions 
via widely distributed redox-sensitive cysteine resi¬ 
dues in proteins. 

Thiol An organic form of sulfur containing a sulfur- 
hydrogen covalent bond. This is the reduced form 
of sulfur in the amino acid cysteine, in the antiox¬ 
idant glutathione, and in most cysteine residues in 
cellular proteins. In proteins, thiols are also termed 
“sulfhydryl groups.” The thiol functional group 
supports diverse reactivity and catalytic functions 
of proteins. Interconversion with disulfides and 
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other forms allows thiols to serve as structural 
transducers as well as chemical signal transducers. 

Xenobiotic A chemical that is foreign to an organism. 
Although often used to refer to man-made 
chemicals, the term is also used to discriminate 
chemicals derived exogenously from the environ¬ 
ment from those that are generated endogenously 
as a consequence of intermediary metabolism. 

Definition of the Subject 

Oxidative stress is a process or outcome in which an 

imbalance in prooxidant and antioxidant reactions 


causes macromolecular damage and/or disruption 
of biologic redox signaling and control (Fig. 1) [1]. 
Oxidation-reduction reactions, or “redox” reactions, 
are reactions involving electron transfer. These reac¬ 
tions are central to energy metabolism and mainte¬ 
nance of metabolic and physical organization of living 
organisms. Environmental agents that interfere with 
redox reactions or promote abnormal redox reactions 
are highly destructive to biologic macromolecules 
and metabolic organization. Electron transfer reactions 
in the atmosphere, water, and soil are common and 
important subjects of environmental sciences. In many 
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The spectrum of oxidative stress 

Macromolecular damage and 

Imbalance in proxidant disruption of complex 

and antioxidant processes biological redox networks 


Physical forces 

Visible and ultraviolet light 
Ionizing radiation 
Sound 
Heat 
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reductants 

Distribution of metal ions 
Nitrogen oxides 
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Organic oxidants and reductants 

Persistant organic pollutants 
(POPs) 

Polycyclic aromatic hydrocarbons 
Cigarette smoke 

Halogenated hydrocarbons 


Excess oxidants 

Exposure to chemicals with 
unpaired electrons (radicals) 

Endogenous generation of radicals; 
lipid peroxidation 

Excess generation of non-radical 
oxidants 


Depletion of protective chemicals 

Photoprotective agents 
Radical scavengers 


Failure of biological protective 
mechanisms 

Altered diet, obesity 

Health behavior, lack of 
exercise 


Destruction of macromolecules: 

DNA, mutations and cancer; 
carbohydrates, cell identity; lipids, 
endogenous generation of reactive 
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Environmental Toxicology: Oxidative Stress. Figure 1 

The spectrum of oxidative stress in environmental toxicology. Oxidative stress refers to adverse effects [right) as well as the 
processes [center) that cause adverse effects in biologic systems. Environmental factors that contribute to oxidative stress 
[left) include physical, inorganic, and organic agents. These agents create imbalances in specific prooxidant and 
antioxidant reactions or reaction pathways [center) due to excess oxidants, depletion of protective chemicals, or failure of 
endogenous protective mechanisms. The imbalances result in the generation of reactive radicals and non-radical oxidants 
that cause macromolecular damage, inhibit thiol-dependent proteins, short-circuit redox pathways, and disrupt 
regulatory mechanisms [right) 
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cases, physical processes and inorganic chemicals are 
relevant causes of oxidative stress to living organisms. 
Oxidative stress can also occur endogenously within an 
organism due to organic chemicals derived from the 
environment. This entry focuses on oxidative stress 
within living organisms, recognizing that the environ¬ 
mental exposures causing oxidative stress in an organ¬ 
ism are highly variable (Fig. 1, left). Many of these are 
addressed in detail in other sections of this 
Encyclopedia. 

Living organisms depend upon the capability to 
maintain chemical and physical homeostasis despite 
variations in environmental exposures. Exposures that 
cause oxidative stress include chemical oxidants such as 
ozone and nitrogen oxides of the atmosphere [2, 3], 
transition metals such as iron [4] and cadmium [5-7], 
visible and ultraviolet radiation from the sun [8, 9], 
ionizing radiation from radioactive decay [10, 11], and 
a large number of pharmaceuticals [12] and chemicals 
from commercial products and industrial wastes 
[13, 14]. In some cases there is no real change in the 
amount of environmental toxicant, only a change in 
distribution that affects exposure of living organisms 
[15]. In other cases, there is an accumulation of envi¬ 
ronmental toxicant due to human activity. Some of 
these are recognized as persistent organic pollutants 
(POPs) [16-18] and other times they are persistent 
products of organic pollutants (P-POPs), some of 
which are not currently known to accumulate or rep¬ 
resent a hazard. For instance, the recent investigation of 
PFOA, a product of fluorocarbons that is of unknown 
origin [19, 20], suggests that other unidentified 
chemicals derived from human activity may exist. Bio¬ 
logic defense mechanisms are diverse and include phys¬ 
ical barriers, chemical detoxification mechanisms, 
repair mechanisms, and avoidance behaviors. Consid¬ 
erable scientific data are available in each of these areas, 
and efforts are currently being initiated to assemble this 
knowledge in a systematic description of the 
“exposome,” i.e., the summation of all environmental 
exposures from conception throughout the life of an 
individual [21, 22]. 

The concepts of oxidative stress are currently 
undergoing revision from the earlier definition as an 
imbalance of prooxidants and antioxidants that cause 
macromolecular damage [23] to one that also includes 


oxidative stress as a disruption of vital signaling and 
control processes within a biologic system [1, 24]. 
In human health, the accumulating data suggest 
that the latter is more relevant. However, in environ¬ 
mental toxicology, both macromolecular damage and 
disruption of redox signaling and control mechanisms 
are relevant. Importantly, new developments in 
systems biology [25, 26], including powerful new 
molecular approaches and analytic tools to study 
genomics [27] and epigenomics [28], gene expression 
(transcriptomics) [29], protein modifications (proteo- 
mics) [30, 31] and chemical profiles (metabolomics) 
[32, 33] provide capabilities to improve surveillance, 
mechanistic understanding, and risk reduction. 

Introduction 

Two-electron and one-electron transfers in biologic sys¬ 
tems. Living organisms obtain the energy necessary to 
sustain organization, delineation from their external 
environment, and reproduction, through electron 
transfer reactions. Loss of electrons is termed “oxida¬ 
tion,” while gain of electrons is termed “reduction” 
(Fig. 2a-c). Because of the conservation of matter in 
chemical reactions, the processes are always coupled, so 
the terms “oxidation-reduction” reactions and “redox” 
(short for “reduction-oxidation”) reactions are usually 
more correct. However, the common process of oxida¬ 
tion of chemicals in the presence of atmospheric 0 2 
(with water as the reduction product) explains the bias 
in terminology. Excessive reduction can also occur, 
termed “reductive stress” [34, 35], and is especially 
relevant to hypoxic and anoxic environmental 
conditions. 

The major elements of life, carbon, hydrogen, oxy¬ 
gen, nitrogen, phosphorus, and sulfur form multiple 
stable biochemical structures in which pairs of elec¬ 
trons are shared. The major macromolecules (protein, 
nucleic acids, lipids, carbohydrates) are formed from 
smaller biochemicals (building blocks: e.g., amino 
acids, nucleotides, fatty acids, sugars) that include 
these elements. The biochemical structures are 
interconverted through chemical reactions that involve 
conservation of these pairs of electrons. The intercon¬ 
versions are often slow without a catalyst. Biologically, 
this involves two-electron transfers by NADH or 
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Environmental Toxicology: Oxidative Stress. Figure 2 

Oxidation-reduction reactions, (a) One-electron transfer reaction from superoxide anion radical (0 2 - *) to Fe 3+ , as occurs in 
transfer from superoxide to oxidized cytochrome c. In this reaction, superoxide anion radical is oxidized to molecular 
oxygen (0 2 ), while the iron is reduced from Fe 3+ to Fe 2+ . Although the reaction can occur in the opposite direction, such as 
the generation of superoxide by electron transfer in oxy-hemoglobin, energetics in biologic systems usually favor the 
forward direction as written, (b) One-electron transfer from Fe 2+ to hydrogen peroxide (H 2 0 2 ). This energetically favorable 
reaction generates three products, Fe 3+ , hydroxyl radical (*OH), and water. The hydroxyl radical is extremely reactive and 
destructive to biochemicals. Combination of reaction A and B constitutes the iron-catalyzed Haber-Weiss reaction, also 
known as the Fenton reaction, which has a central role in radical-mediated macromolecular damage. This process is very 
destructive when appropriate concentrations of superoxide, H 2 0 2 , and iron are present, (c) Flavin-containing enzymes, 
such as NADPH oxidases, catalyze both 1 -e“ and 2-e _ transfers to 0 2 , thereby forming both superoxide and H 2 0 2 . Aerobic 
cells contain NADPH, 0 2 , and flavoproteins so that conditions are poised to support Fenton chemistry (A + B) when these 
enzymes are active. The most potent NADPH oxidase, Nox2, is present in phagocytic cells and activated to kill the invading 
microorganisms, (d) Two-electron transfer, shown for the detoxification enzyme NADPHrquinone reductase-1 (NQOI), is 
used to support chemical interconversions in intermediary metabolism. The niacinamide group shown for NADPH is 
identical to that in NADH, and serves to transfer a pair of electrons as a hydride ion (H“) without radical formation. The 
reaction shown is an important detoxification reaction by which common environmental quinones are reduced to quinols 
for phase 2 conjugation and excretion. Quinones can function either as oxidants or electrophiles, and can also generate 
radicals through redox cycling (see Fig. 3). These adverse reactions are prevented by the NQOI reaction, (e) Thiols are used 
as reductants to eliminate H 2 0 2 . In the glutathione (GSH) peroxidase reaction shown, two molecules of GSH are used to 
reduce H 2 0 2 to water without forming radicals as intermediates. Reduction of the oxidized product, GSSG, by an NADPH- 
dependent reductase (not shown) completes a detoxification cycle for peroxides. This reaction cycle is complemented by 
a thioredoxin/peroxiredoxin system to maintain peroxide concentrations very low in cells despite continuous generation. 
In some cases thiol radicals are formed, but these rapidly react to form disulfides, thereby minimizing toxic effects 
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NADPH, with the electrons being transferred as 
a hydride (H - ) ion (Fig. 2c-e). Enzymes serve as cata¬ 
lysts to enhance rates of reactions so that the presence 
of enzymes largely determines the biochemical path¬ 
ways for interconversion of the carbon-containing bio¬ 
chemicals. Many of these interconversions involve 
reaction of “electrophiles,” chemicals with orbitals defi¬ 
cient in electrons, and “nucleophiles,” chemicals with 
orbitals with excess electron density. 

The two-electron (2-e - ) transfer reactions are 
contrasted by one-electron (l-e _ ) transfers, which are 
predominant reactions of many biologically important 
metals, such as iron and copper, environmentally 
important metals, such as cadmium, chromium, and 
tin, and atmospheric reactions driven by ionizing and 
non-ionizing radiation. Living organisms are depen¬ 
dent upon l-e _ chemistry because the central energy 
capture and transfer reactions of photosynthesis and 
mitochondrial respiration involve l-e _ transfers. How¬ 
ever, these l-e“ transfers are physically contained 
within subcellular structures (chloroplasts and mito¬ 
chondria), so that both l-e“ and 2-e“ chemistries can 
occur simultaneously and be used to provide critical 
reaction pathways in biological systems. The incom¬ 
patibility is illustrated by the combination of reactions 
A and B in Fig. 2, where superoxide donates an electron 
to Fe 3+ to form Fe 2+ , and Fe 2+ reacts with hydrogen 
peroxide to form the highly reactive and destructive 
hydroxyl radical. The l-e _ systems of mitochondria 
and chloroplasts are important in environmental toxi¬ 
cology because they represent a loaded gun; these sys¬ 
tems have very high electron transfer rates, are sensitive 
to visible and ultraviolet light, trace metals, oxidants 
and electrophiles, and cause extensive destruction to 
other biologic components when triggered by environ¬ 
mental toxicants. 

The 2-e _ and l-e _ pathways are connected through 
flavoproteins and hydroquinone/quinone structures 
that exist in interconvertible fully oxidized, l-e~- 
reduced and 2-e _ -reduced forms. Flavoproteins are 
widely used in biology to support 2-e _ transfer reac¬ 
tions involving NADH and NADPH. In addition, they 
are critical to connect 2-e _ pathways to l-e _ pathways, 
accepting 2 e _ from NADH or NADPH and transfer¬ 
ring 1 e _ to electron acceptors. An illustration is given 
in Fig. 2c, where different NADPH oxidases support 
both 1- and 2-e~ reduction of 0 2 . Flavoproteins 


catalyze l-e _ transfer to the mitochondrial respiratory 
chain, the major enzyme systems for oxidation of xeno- 
biotics, and to 0 2 to form ROS for signaling and 
defense against invading microorganisms. They also 
are a major source of toxic oxidant generation by ini¬ 
tiating lipid peroxidation (Fig. 3). The planar aromatic 
structure interacts with polycyclic aromatic structures, 
facilitating the generation of reactive chemicals with an 
unpaired electron, termed “radicals,” in a process 
known as redox cycling [36]. Flavins and flavoproteins 
are also photoreactive because they absorb visible light, 
and this activates electrons for transfer to other 
chemicals. 

Antioxidants include chemicals that are relatively 
stable as radicals and donate or accept an electron to 
reactive radicals to decrease their destructive chemistry. 
This is highly beneficial by preventing radical-mediated 
destruction of macromolecules. Vitamin C (ascorbic 
acid) and vitamin E (a-tocopherol) are radical¬ 
scavenging antioxidants and function to terminate 
free radical chain reactions (See Fig. 3). Vitamin C is 
water soluble and synthesized by most animals but not 
by humans. Vitamin E is fat soluble and one of several 
lipophilic chemicals that blocks radical reactions in 
biologic systems. Plants are rich in radical-scavenging 
antioxidants, and also in carotenoids, light-absorbing 
chemicals that protect against light-induced oxidative 
damage. 

Evolution of "Free Radical" and "Radical-Free" 
Concepts of Oxidative Stress 

Joseph Priestley, an eighteenth century theologian and 
chemist credited with the first publication demonstrating 
production of 0 2 [37], intimated that “dephlogisticated 
air” was toxic even before Lavoisier demonstrated 
that “dephlogisticated air” was oxygen [38]. Priestley 
said of breathing dephlogisticated air “.. .1 fancied that 
my breast felt peculiarly light and easy for some 
time...” and “a candle burns out much faster in 
dephlogisticated air than in common air, so we might, 
as may be said, live out too fast, and the animal powers 
be too soon exhausted in the pure kind of air. 
A moralist, at least, may say, that the air which nature 
has provided for us is as good as we deserve” [39]. 

This conclusion of the late eighteenth century pre¬ 
saged a cliche of late twentieth century science, that 
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Environmental Toxicology: Oxidative Stress. Figure 3 

Lipid peroxidation is a common radical chain reaction that occurs in biologic systems. Radical chain reactions include 
initiation, propagation, and termination events. In biologic systems, initiation ( top left) often occurs by redox cycling 
of quinone compounds and reductive activation of halogenated hydrocarbons (e.g., CCI 4 ). In redox cycling, quinones 
accept an electron from a reduced flavoprotein, forming a radical. The radical can donate or accept an electron 
through a hydrogen abstraction from a polyunsaturated fatty acid (PUFA) ( top middle). Normally this is most 
energetically favorable at a carbon adjacent to carbons with double bonds. The resulting radical rearranges to form 
a conjugated double bond system with an adjacent carbon-centered radical. Under aerobic conditions, 0 2 rapidly adds to 
form a peroxyl radical (center). The peroxyl radical reacts with another PUFA (right middle) to propagate the free radical 
chain reaction. In this reaction, the peroxyl radical is reduced to a lipid hydroperoxide, which is normally reduced by 
GSH peroxidases or peroxiredoxin-6 (not shown). When activities of the latter systems are impaired and free iron is 
present, the process can amplify lipid peroxidation by Fe 2+ -dependent initiation of another radical chain reaction 
(bottom left). In the presence of efficient radical scavengers, such as vitamin C and vitamin E, the chain reactions are 
terminated. Note that vitamins C and E are most active with different types of radicals and thus complement each other 
in function. Free radical chain reactions are limited in normal living tissues due to the presence of antioxidants and very 
high protein concentration. Lipid peroxidation in oils, fats, and decaying matter is very active when antioxidants are 
depleted 
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“oxygen is a double-edged sword” While science is 
usually better than cliche, the cliche was highly effective 
in promoting science, and in this case, an almost reli¬ 
gious fervor to introduce “antioxidants” in dietary, 
cosmetic, and other commercial product development. 
Failure of large-scale, double-blind interventional trials 
with antioxidants in humans [40-47] has led to 
a backlash [48]. Some scientists now have a dismissive 
attitude concerning the importance of oxidative stress. 
However, the scientific knowledge of the mechanisms 
of oxidative stress is very strong and clearly establishes 
oxidative stress as relevant to environmental toxicol¬ 
ogy. A brief look at the evolution of concepts facilitates 
resolution of apparent conflicts between the extensive 
biomedical literature on oxidative stress and implica¬ 
tions for environmental health. 

Catalase was one of the first enzymes to be studied 
[49], and its association with aerobic life was recog¬ 
nized by early biochemists [50]. Hydrogen peroxide 
(H 2 0 2 ) was recognized as a toxic reactive oxygen spe¬ 
cies (ROS) associated with 0 2 exposure [51]. Early 
studies also included l-e“-transfer reactions of the 
mitochondrial electron transfer pathway, begun by 
Keilin in the 1920s [52]. During this time, the environ¬ 
mental agent paraquat (methyl viologen) was intro¬ 
duced as an “electron mediator” to transfer electrons 
into the mitochondrial pathway [53]. The concepts of 
oxidative stress, though, are perhaps better traced to 
the concept of “free radicals,” organic molecules with 
an unpaired electron, and their damaging effects on 
biological systems. The chemical terminology for 
a free radical has been changed to “radical,” but the 
former is still more commonly used. 

The study of radicals in biologic systems was popu¬ 
larized by the discovery of lipid peroxidation (Fig. 3) and 
antioxidant actions of vitamin E [54] and selenium- 
dependent glutathione peroxidase [55, 56]. Additional 
toxicologic studies of carbon tetrachloride (Fig. 3) 
showed that radical reactions of lipid peroxidation 
were responsible for hepatotoxicity [57, 58]. Finally, 
Denham Harmon attracted attention through his 
hypothesis that free radicals contributed to aging [59]. 

Lipid peroxidation is a free radical chain reaction 
contributing to rancidity of oils and fats and invariably 
present in dead and decaying matter [60]. Radical 
chain reactions involve an initiation event in which 
a radical is formed, subsequent propagation events, 


and finally termination reactions that eliminate the 
radicals (Fig. 3). Chemicals, such as quinones, that 
interact with flavoproteins and redox cycle [36] are 
a common source of initiation (Fig. 3, top left). 
Another process involves activation of halogenated 
hydrocarbons and other environmental chemicals to 
free radicals by the hemoproteins of the cytochrome 
P450 (CYP) family (Fig. 3, top left). These radicals 
abstract a hydrogen atom from polyunsaturated fatty 
acids (PUFA; Fig. 3, top center), which result in the 
rearrangement of the newly formed fatty acid radical to 
a more stable form with conjugated double bonds 
(Fig. 3, center). 0 2 readily adds to this intermediate 
to form a peroxyl radical, which then propagates the 
chain reaction by the abstraction of a hydrogen atom 
from another PUFA (Fig. 3, right center). In the pres¬ 
ence of trace Fe 2+ , the process can be amplified by the 
creation of a new radical from the lipid hydroperoxide 
(Fig. 3, lower left). Vitamins C and E terminate the 
process by reducing the reactive intermediates and 
breaking the chain reaction. 

While this constellation of findings and conceptual 
developments advanced the biochemical study of free 
radicals, the unanticipated finding of an enzyme that 
detoxifies superoxide attracted the attention of the 
broad spectrum of respiratory biologists and invigo¬ 
rated a new field of free radical biology and medicine. 
Study of the 0 2 -dependent reduction of cytochrome 
c by xanthine oxidase led to the discovery that the 
abundant blood protein erythrocuprein was an enzyme 
that converted the l-e _ reduction product of 0 2 , i.e., 
the superoxide anion radical (0 2 ~') to 0 2 plus H 2 0 2 
[61]. The discovery of this enzyme, superoxide 
dismutase [62], was seminal because it provided evi¬ 
dence that biological systems had evolved an enzyme to 
eliminate radicals. This provided clear evidence that 
radicals must be commonly produced and be a threat 
to biological systems. 

Chance et al. [63] provided an authoritative review 
on hydroperoxides that served as a platform for the 
integration of concepts of biologic damage by l-e _ and 
2-e _ oxidative processes by Sies [23] in the first mono¬ 
graph entitled “Oxidative Stress.” In this volume, Sies 
[23] defined oxidative stress as “an imbalance of 
prooxidants and antioxidants in which the former pre¬ 
dominates and causes macromolecular damage.” This 
clear and simple definition proved useful to stimulate 
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research on oxidative stress as a mechanism of disease 
and aging, and antioxidants as means to protect against 
these deleterious processes. 

The contemporary definition of oxidative stress has 
been refined based upon research findings of the turn 
of the century [1, 24, 64, 65]. Discovery of a family of 
enzymes, NADPH oxidases, that generates 0 2 ~ and 
H 2 0 2 as signaling molecules [66, 67] showed that one 
cannot simply equate ROS with harmful chemistry, i.e., 
oxidative stress. 

At the same time, results of large-scale, double¬ 
blind interventional studies with radical-scavenging 
antioxidants began to appear in the literature 
[40-47]. These studies showed that supplementation 
with antioxidants, i.e., shifting the “balance” toward 
protection, had little or no health benefit in humans. 
Consequently, the original definition of oxidative stress 
needs to be qualified in the sense that an “imbalance” is 
only considered in terms of specific reactions or path¬ 
ways and not in terms of a global imbalance. In other 
words, an imbalance in a specific reaction or pathway 
can cause damage without an overall imbalance in the 
system, and a change in overall balance of prooxidants 
and antioxidants can occur without causing damage. 

The contemporary view of oxidative stress has also 
been impacted by the development of new -omic tech¬ 
nologies and systems biology. Instead of being limited 
to overall measures such as cell death [68], lipid perox¬ 
idation [69], or protein carbonylation [70], one can 
now measure effects on expression of specific genes 
[29], modification of specific lipids [71], and oxidation 
of specific proteins [72]. This research now shows that 
oxidative stress as it applies to environmental toxicol¬ 
ogy involves two processes: one related to earlier con¬ 
cepts of radical-mediated macromolecular damage and 
the other related to disruption of redox signaling and 
control mechanisms, which are largely non-radical in 
nature. 

Radical Mechanisms of Environmental Toxicity 

The lipid peroxidation mechanisms described above 
(Fig. 3) are more complex in living organisms because 
of the high concentrations of protein, the abundance of 
free radical scavenging antioxidants, and the processes 
of biologic turnover, which eliminate and replace dam¬ 
aged macromolecules and cells. In addition to redox 


cycling agents and halogenated hydrocarbons, 
a number of other initiators occur, including transition 
metals, ionizing and non-ionizing radiation, and 
sound. Because of the high concentration of protein 
in cells, the number of propagation reactions tends to 
be small, emphasizing the importance of initiation 
events in toxicity. 

In pure lipid systems, free radical chain reactions 
can occur in which an initiation event causes modifi¬ 
cation of 200-400 molecules of fatty acid prior to being 
terminated by radicals reacting with each other. How¬ 
ever, in biologic systems, such a chain reaction does not 
occur for several reasons. One is that initiation events 
are prevented by keeping free metal ion concentrations 
very low, and blocking other common initiation events. 
Another is that the concentration of protein is so high 
that H- is often abstracted from proteins rather than 
other polyunsaturated lipids, thereby decreasing prop¬ 
agation. Biologic systems also contain high concentra¬ 
tions of radical scavengers, such as vitamin C and 
vitamin E, which function as chain-terminating anti¬ 
oxidants. Amplification is also effectively prevented by 
the removal of lipid hydroperoxides by the peroxidases. 

The propagation reactions are also not simple 
because multiple rearrangement products can occur. 
In addition to the rearrangement shown in Fig. 3, 
PUFA radicals can rearrange to eliminate 
a conjugated aldehyde, such as 4-hydroxynonenal 
(HNE). HNE is a reactive conjugated aldehyde that 
can react with proteins or DNA. The reactions gener¬ 
ating HNE also generate carbon-centered radicals that 
propagate chain reactions. A number of other products 
can be formed, including epoxides (Fig. 4), which 
can be important as more stable forms with greater 
specificity in reaction with macromolecules, and 
isoprostanes [73], rearrangement products that are 
useful as biomarkers of this chemical reaction 
sequence. Conjugated aldehydes react with thiols and 
amines of proteins, creating modified proteins and 
degradation products that are universally detectable 
in biologic systems. Thus, lipid peroxidation appears 
to be an ongoing process. These are measured as pro¬ 
tein carbonyls, another useful way to monitor oxidative 
stress in biologic systems [70]. 

Lipid peroxides are also formed enzymatically and 
are metabolized by several peroxidase systems. The 
most extensively studied enzymes are the glutathione 
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Environmental Toxicology: Oxidative Stress. Figure 4 

Non-radical oxidants in biologic systems. Difficulties 
in quantifying oxidants and their rates of generation, 
as well as complexities in adaptive and toxic responses 
to long-term oxidant exposure and a generally 
conservative approach to terminology, have resulted 
in widespread use of the term "reactive oxygen species" 
(ROS). Although convenient, this term does not 
discriminate between radical and non-radical species 
and carries a bias that relevant biologic oxidants 
are closely related to 0 2 . A large number of non-radical 
oxidants, some of which are listed in this figure, exist in 
the environment and in biologic systems. Ozone 
and singlet oxygen (triplet 0 2 is ground state) are common 
in the atmosphere, while H 2 0 2 , hypochlorous acid and 
lipid hydroperoxides are generated enzymatically and 
non-enzymatically in biologic systems. A number of 
other reactive carbon oxidation species occur, such as 
endoperoxides, epoxides and quinones [right). 

Reactive nitrogen oxidation products are also relevant 
biological oxidants. Nitrous acid and dinitrogen 
tetroxide are common in the atmosphere and also in 
biologic systems. Peroxynitrite is a reactive oxidant 
produced by reaction of two radicals, nitric oxide and 
superoxide. Nitric oxide (NO-) is a protective 
signaling molecule in the vascular system and is converted 
to peroxynitrite in the presence of superoxide. 
Alkylperoxynitrites can similarly form from nitric oxide 
and lipid peroxyl radicals. Other oxidants can also 
occur, such as disulfides of cysteine and other low 
molecular weight thiols. Other sulfur oxidation products, 
such as sulfenates and sulfinates, can also function as 
oxidants 


(GSH) peroxidases [74], but more recent data 
indicate that a thioredoxin-dependent peroxidase, 
peroxiredoxin-6, is more active in the elimination of 
lipid hydroperoxides [75, 76]. The lipid peroxides can 
be very important in lipid peroxidation by providing 
a mechanism for amplification of lipid peroxidation 
[60] . Trace metal ions, such as Fe 2+ or Cu 1+ , can donate 
an electron to the peroxide to form another radical 
initiation event. 

Because of these processes, radical chain reactions 
are almost completely prevented in living cells. In con¬ 
trast, oxidative processes are a component of cellular 
destruction mechanisms used to kill microorganisms, 
to establish physical barriers to prevent infection, and 
to degrade and remove dead and dying cells. Phago¬ 
cytes have a respiratory burst in which NADPH 
oxidase-2 generates ROS to kill microorganisms [67]. 
During apoptosis, a mitochondrial respiratory burst is 
created by the release of mitochondrial cytochrome 
c and switching the electron transfer chain to one that 
generates 0 2 ~ [77] . Thus, unlike the interpretation that 
continuous generation of biomarkers of oxidative 
stress indicates that 0 2 is a double-edged sword, the 
evidence shows that oxidative activities are part of 
normal biologic processes and that there is reserve 
capacity to protect against catastrophic collapse. 
Chain reactions of lipid peroxidation do not provide 
a major, central toxicologic process, but rather, when 
lipid peroxidation occurs, it mostly occurs by radical 
initiation events directly from environmental cause 
rather than as a consequence of chain reactions. 

In summary, radical mechanisms are important for 
environmental toxicology because environmental 
agents can directly cause initiation of radical reactions 
that damage all types of macromolecules. Methods are 
available to evaluate radical-dependent mechanisms of 
oxidative stress in biological materials, especially 
through measurement of isoprostanes, protein car¬ 
bonyls, and other well-documented oxidation prod¬ 
ucts. Low levels of these products are found in normal 
cells, and healthy organisms are well protected against 
these reactions. Based upon extensive double-blind 
interventional studies with free radical scavengers in 
humans, these reactions do not contribute significantly 
to long-term health even though many studies show 
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short-term reductions in biomarkers of oxidative 
stress. Thus, in the absence of high concentrations of 
acute environmental exposure, the radical mechanisms 
appear to be secondary to non-radical oxidative mech¬ 
anisms of toxicity [78]. 

Non-radical Mechanisms of Environmental 
Toxicity 

The principles of non-radical mechanisms of oxidative 
stress are outlined in detail in “Radical-free biology of 
oxidative stress” [78]. Research has provided extensive 
knowledge of radical chemistry and antioxidants stud¬ 
ied in purified systems and subcellular fractions of 
biologic systems, often with little regard to biologic 
relevance [79]. Most critically, biologic systems gener¬ 
ate more non-radical oxidants (Fig. 4) than radicals, 
and the effective radical scavenging processes largely 
convert radicals to non-radicals or otherwise limit their 
destructive potential [78]. Important non-radical oxi¬ 
dants (Fig. 4) include H 2 0 2 , lipid hydroperoxides, qui- 
nones, disulfides, and reactive nitrogen species (RNS), 
especially peroxynitrite [80]. Peroxynitrite is a major 
oxidant formed by reaction of superoxide with nitric 
oxide, a protective signaling molecule controlling 
microcirculation and immune function [80, 81]. 

In the xanthine oxidase system, which provided the 
discovery of superoxide dismutase, the predominant 
product is H 2 0 2 , not the radical superoxide anion 
[82]. Under optimal conditions for univalent flux to 
form superoxide, this represents only 30% of the total 
flux, i.e., under all conditions divalent flux to form 
H 2 0 2 predominated [82]. The univalent flux was con¬ 
siderably less under most conditions, consistent with 
partitioning of radical and non-radical oxidant pro¬ 
duction as depicted in Fig. 5. Many redox proteins 
that generate superoxide show rapid, sequential univa¬ 
lent transfer so that H 2 0 2 is a common product of these 
reactions. Furthermore, dismutation of superoxide 
anion to H 2 0 2 and 0 2 occurs at a rapid rate (46) and, 
as indicated above, reaction of radicals with radical 
scavengers results in efficient conversion to non-radical 
oxidants. Consequently, the 2-e _ , non-radical oxidant, 
H 2 0 2 , is produced at a higher rate and appears to 
represent the major oxidant burden. This rate has 
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Environmental Toxicology: Oxidative Stress. Figure 5 

Non-radical mechanisms of oxidative stress are 
quantitatively important in chronic and delayed toxicities. 
Although toxicity due to radical-induced macromolecular 
damage occurs under some conditions of acute toxicity, 
accumulating evidence indicates that non-radical 
mechanisms are important under other conditions. In this 
figure, the total rate of 0 2 reduction to generate reactive 
oxygen species is expressed as 100%. Two-electron 
transfers predominate at biologic active sites that reduce 
0 2 to superoxide and H 2 0 2 , so that radicals are always 
produced at lower rates (e.g., <10% of total) than 2- 
electron products, such as H 2 0 2 (>90% of total). Radical 
scavengers, including superoxide dismutase, are highly 
efficient (e.g., >99%) so that the radicals are largely 
converted to non-radical oxidants. This results in a low 
level of radical-mediated macromolecular damage and 
oxidative stress predominantly caused by non-radical 
oxidants. Non-radical oxidants preferentially target 
reactive thiols of redox signaling and control pathways, 
resulting in a disruption of signaling and control. 
Disruption of signaling and control mechanisms result in 
long-term dysfunction and reproductive and 
developmental toxicities. These can be insidious and pose 
a risk for environmental sustainability 

been estimated to be 1-4% of the rate of 0 2 consump¬ 
tion in mammalian systems [63], but rates can 
be higher in some compartments like the peroxisomes 
[83], or lower, such as in the cytoplasm or nuclei [84]. 

With non-radical oxidants being quantitatively 
most important and radical scavenging antioxidants 
converting some of the radicals to non-radical 
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oxidants, it is not surprising that radical-scavenging 
antioxidants do not provide protection against disease 
[40-47]. Perhaps a better way to view this in terms of 
environmental toxicity is that radical mechanisms are 
likely to contribute to acute toxicities but not as likely 
to contribute to chronic toxicities. In these reactions, 
macromolecular damage is of major concern, 
and all classes of macromolecules are potentially vul¬ 
nerable. On the other hand, non-radical mechanisms 
can contribute to both; as described below, these 
are most relevant to chronic toxicity and can 
have effects through disruption of redox signaling and 
control, whether or not macromolecular damage 
occurs. 

The major mechanisms of non-radical oxidative 
stress involve effects on thiols in proteins [78], 
although mutagenic damage to DNA also occurs [11, 
85, 86]. Non-radical oxidants can interfere with revers¬ 
ible oxidation-reduction reactions of thiols by causing 
abnormal oxidation or irreversible modification. These 
changes alter physiological function in receptor signal¬ 
ing, transcriptional regulation, cell proliferation, 
angiogenesis, and apoptosis. Whether or not macro¬ 
molecular damage occurs, the disruption of these func¬ 
tions alters cell fate and impacts physiologic health. 
Such changes can be adaptive and protective, in the 
process known as hormesis [87], but also can cause 
maladaptation and disease. These processes can occur 
due to a number of specific environmental agents, can 
occur in combination, and can be insidious in their 
long-term effects. 

These concepts have been formalized in the redox 
hypothesis of oxidative stress [78], which provides four 
postulates to guide investigation into detailed mecha¬ 
nisms, ways to detect oxidative stress, and strategies to 
intervene to avoid or minimize oxidative stress. These 
postulates point to a need to incorporate systems biol¬ 
ogy, modern -omic technologies and bioinformatics into 
environmental health research, specifically to address 
environmental toxicities. These four postulates are: 

1. All biologic systems contain redox elements (e.g., 
redox-sensitive cysteine, Cys, residues) that func¬ 
tion in cell signaling, macromolecular trafficking, 
and physiologic regulation. 

2. Organization and coordination of the redox activity 
of these elements occurs through redox circuits 


dependent upon common control nodes (e.g., 
thioredoxin, GSH). 

3. The redox-sensitive elements are spatially and 
kinetically insulated so that “gated” redox circuits 
can be activated by translocation/aggregation and/ 
or catalytic mechanisms. 

4. Oxidative stress is a disruption of the function of 
these redox circuits caused by specific reaction with 
the redox-sensitive thiol elements, altered pathways 
of electron transfer, or interruption of the gating 
mechanisms controlling the flux through these 
pathways. 

Proteins contain three functional groups that 
undergo reversible oxidation, the thiol in cysteine 
(Cys), the thioether in methionine (Met), and the 
selenol in selenocysteine (Sec). Cys has been studied 
most extensively, with relevant oxidation states includ¬ 
ing the thiol (-SH), disulfide (-SS-), sulfenate (-SO - ), 
sulfinate (-S0 2 ~), and sulfonate (-S0 3 - ). Thiyl radi¬ 
cals (-RS-) generated from thiols in the presence of 
oxygen-centered radicals [88], as well as other reactive 
sulfur species [89], can also be considered as toxic 
species in oxidative stress. Thiyl radicals rapidly react 
to form disulfides [90], and this may serve as 
a convergence point between 1-e - and 2-e - events. 
Relatively unstable sulfenates are converted to 
disulfides in the presence of thiols, and in some protein 
structures are stabilized as sulfenamides [91, 92]. 
Higher oxidation states, sulfinates and sulfonates, are 
typically not reversible in mammalian systems. A yeast 
enzyme (sulfiredoxin) reduces sulfinate in 
peroxiredoxin [93] and could be important in redox 
signaling mechanisms [94, 95]. 

Reversible oxidations of Met residues [96, 97] and 
the less common amino acid, selenocysteine (Sec) 
[98, 99], can also be important in toxicologic mecha¬ 
nisms. Met oxidation to methionine sulfoxide occurs in 
oxidative stress and aging [97, 100]. A good example, 
studied extensively in association with cigarette 
smoking, is the increased lung rigidity associated with 
loss of a 1-antitrypsin inhibitor activity upon Met oxi¬ 
dation [ 101 ]. The loss of inhibition of elastase results in 
damage to lung structures and contributes to obstruc¬ 
tive lung disease. Additional research shows that two 
types of methionine sulfoxide reductases (MSR) are 
important in protecting against distinct S- and 
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R-sulfoxides in methionine residues [96, 102, 103]. 
These MSR activities are dependent upon thioredoxins 
[103] and are associated with longevity [ 104-107] . This 
less frequently studied type of oxidation involves both 
Met and Sec, and probably deserves much greater 
attention in environmental oxidative stress due to the 
variable environmental availability of Met and Sec. The 
selenol of Sec undergoes reversible oxidation- 
reduction during catalytic functions of thioredoxin 
reductases and Se-dependent glutathione peroxidases 
[98, 99], enzymes present at key positions in both 
thioredoxin and GSH pathways and critically impor¬ 
tant to protect against non-radical oxidative stress. 

Key points of oxidative stress according to the redox 
hypothesis include recognition of mechanisms of oxi¬ 
dative stress other than an imbalance of prooxidants 
and antioxidants. One of these involves altered func¬ 
tion of redox circuits caused by specific reaction with 
redox-sensitive thiol elements (Fig. 6). This differs 
from the earlier concepts of macromolecular damage 
in that it could affect adaptability or tolerance of an 
organism without overt toxicity. For instance, redox¬ 
sensing thiols could govern tolerance to cold or 
prolonged starvation. Modification of these thiols 
would have no effect without the environmental chal¬ 
lenge but would have catastrophic outcome with cold 
exposure or starvation. Environmental exposures 
could also create new pathways for electron transfer, 
effectively creating short circuits. Such short circuits 
may be accommodated by the stability of a network 
structure but create an overall vulnerability to other 
exposures. 

Finally, a redox system design using low-flux sys¬ 
tems to control high-flux systems is economical in 
terms of energy utilization [64]. Such a structure can 
be highly sensitive to low level exposures because it 
allows toxic agents to interrupt the gating mechanisms 
that control flux through these pathways. Because of 
this, accumulation of low levels of certain agents may 
induce a loss of function. 

Future Directions 

Oxidative stress and environmental sustainability. The 
view of oxidative stress as a disruption of redox 
signaling and control, as opposed to an imbalance 
of prooxidants and antioxidants, has different 


implications for sustainability in the environment. If 
oxidative stress were simply an imbalance, then oxida¬ 
tive stress caused by environmental agents could be 
eliminated by removing the source or directly compen¬ 
sated by increased antioxidants. Cleanup efforts gener¬ 
ally approach the problem as though eliminating the 
source were sufficient. However, this simplistic 
approach, like the antioxidant supplementation trials, 
does not effectively address the problem with regard to 
environmental sustainability. 

A critical issue is that complex systems respond to 
challenges with adaptation to maintain function. 
Because of this, subtoxic exposures can accumulate 
without overt signs of adverse effects. In the long 
term, this is not sustainable. Consequently, studies to 
evaluate impact can provide reassuring evidence that 
specific environmental challenges are not a problem, 
when the data may really mean that the biologic sys¬ 
tems are effectively engineered to conceal hazards. 
While this may seem overly paranoid, millions of new 
chemicals have been synthesized in research efforts, 
and tens of thousands are in commercial use. Conse¬ 
quently, to consider sustainability in terms of oxidative 
stress, one must evaluate sustainability in terms of the 
impacts of cumulative exposures and acknowledge the 
reality that one cannot correct oxidative stress simply 
by shifting the balance with antioxidants. 

Unfortunately, the scientific method is not well 
designed to deal with complex systems. The range of 
responses to multiple exposures within complex sys¬ 
tems over long periods of exposure cannot be evaluated 
or predicted accurately with contemporary methods. 
Sustainability in the environment necessitates new 
ways to evaluate exposures and biologic responses. 

From the standpoint of oxidative stress, a redox 
proteomics approach provides means for systems anal¬ 
ysis. With a currently available liquid chromatography- 
mass spectrometry approach, one can measure the 
percent oxidation of specific cysteine residues in hun¬ 
dreds of proteins [108]. These Cys residues represent 
only a fraction of the total Cys encoded by the genome, 
which is about 200,000 in the 18,000 proteins in 
humans, but none-the-less provide an indication of 
the steady-state oxidation of the individual. The Cys 
residues change in oxidation state in response to expo¬ 
sures so that one could, in principle, establish sentinel 
species to examine environmental oxidative stress. 
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Environmental Toxicology: Oxidative Stress. Figure 6 

Targets of oxidative stress include redox-sensitive thiols functioning in subcellular compartments. The human genome 
encodes 214,000 different cysteine residues. The resulting cysteine proteome is distributed among cellular compartments, 
with extracellular protein thiols (indicated as nodes) interacting with free cysteine and cystine, designated as a CySS redox- 
control hub, in the plasma and interstitial space. Solid connectors (edges) indicate direct interactions in a network 
structure. On the alveolar surface of the lung and lining of the gastrointestinal tract, GSH and GSSG appear to serve as the 
predominant redox-control hub. The extracellular thiol/disulfide systems are connected to intracellular systems 
(designated by broken lines) principally through transport systems or other indirect mechanisms. Within the cytoplasm, 
a CySS hub and a H 2 0 2 hub function in the oxidation of cytoplasmic proteins. These oxidative reactions are ongoing and 
are countered by two major NADPH-dependent thiol reduction systems, represented by thioredoxin-1 (Trxl) and GSH/ 
GSSG hubs. The relative rates for different proteins vary, with the majority of Cys residues being largely reduced under 
steady-state conditions. Trxl is translocated into nuclei during oxidative stress; in the nuclei, Trxl is maintained at a more 
reduced state than in the cytoplasm. Both nuclear Trxl and nuclear GSSG hubs function are at least partially independent 
of respective cytoplasmic hubs. These hubs are important for redox control of transcription and are, therefore, critical 
nuclear targets of oxidative stress. Mitochondria contain Trx2 and GSH/GSSG hubs that control redox states of 
mitochondrial proteins. Lipophilic cationic oxidants can be especially toxic because they accumulate in mitochondria and 
disrupt bioenergetic control. In the secretory pathway, protein disulfide isomerase (PDI) is oxidized by an endoplasmic 
reticulum oxidase system (EROS) and functions as a redox hub controlling Cys oxidation in proteins destined for the 
plasma membrane or extracellular compartments. The range of organelle function in specialized tissues results in organ 
system-specific oxidative stress. 
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Two types of data can be obtained, one giving maps 
of electron transfer circuits and the other giving func¬ 
tional pathway maps linked by Cys oxidation [108]. 
Such a biomonitoring approach could be used to estab¬ 
lish cumulative databases, providing a strategy to 
quantify deterioration of environmental conditions in 
terms of disruption of protein redox systems in the 
sentinel species. Of course, humans could be one of 
the sentinel species, and evaluation of protein oxida¬ 
tion in human samples could be useful to document 
oxidative stress. Similar information could be obtained 
from gene expression profiling; complete genomic 
information is available now for several species, so 
that multiple options are available to test the feasibility 
of such approaches. 

Unfortunately, changes in redox proteomics or gene 
expression only reveal whether deterioration has 
occurred, and do not provide insight into the underly¬ 
ing causes. Exposure information will be needed to link 
environmental deterioration to specific causes. Current 
environmental surveillance methods are poorly 
equipped to address this problem because analyses are 
highly targeted to a relatively small number of hazard¬ 
ous agents. Newer methods are available to profile 
chemicals in biologic extracts, which show that thou¬ 
sands of chemicals can be detected in a single analysis 
[33] . Less than half of the chemicals detected in human 
plasma have been identified, meaning that humans are 
exposed to a large number of unidentified chemicals. 
Some of these may be environmental degradation 
products of known hazardous and nonhazardous 
chemicals. Whether these chemicals are helpful or 
harmful is not known, but multivariate analysis and 
modern bioinformatic methods can support studies 
linking alteration in metabolic characteristics with 
abundance of specific chemicals in biologic samples. 
Clearly, investment in new methods as well as cumula¬ 
tive environmental health data registries will be needed 
to address critical questions of environmental sustain¬ 
ability with regard to oxidative stress. 

During the past decade, methods have been devel¬ 
oped to allow study of oxidative stress within specific 
subcellular compartments [84, 109]. Mammalian cells 
contain about 214,000 Cys residues encoded in the 
genome, and redox biochemical approaches are begin¬ 
ning to provide an understanding of the organization 
of the redox network structure [108, 110, 111]. This 


research has revealed that oxidative stress is not uni¬ 
form within cells but rather selectively affects specific 
pathways in compartments. The extracellular compart¬ 
ments of plasma and interstitial space are generally 
more oxidized than cellular compartments (Fig. 6) 
[84], have fewer antioxidant systems, and are vulnera¬ 
ble to oxidants. The lung alveoli, oral mucosa, and 
intestinal lumen are supplied with the thiol antioxidant 
GSH, and have protective enzymes associated with the 
mucous-lining layer. Thus, the maintenance of these 
surface barriers is critical for the protection of an 
organism because the lung lining is first exposed to 
airborne toxicants, while the intestine is first exposed 
to ingested toxicants. 

Within cells, different redox systems function in the 
organelles. The endoplasmic reticulum (ER) and secre¬ 
tory pathway use protein disulfide isomerase (PDI) and 
an oxidase system (EROS) to oxidize proteins during 
processing for secretion ([112]; Fig. 6). Disruption of 
this oxidative pathway activates cell death through an 
ER stress mechanism [112,113]. Nuclei and mitochon¬ 
dria are more reduced [84], and each has specific 
redox-sensitive proteins. Mitochondria have a unique 
thioredoxin-2, Trx2, and glutaredoxin-2, Grx2, while 
thioredoxin-1 and glutaredoxin-1 are found in the 
cytoplasm. Trxl is translocated into nuclei during oxi¬ 
dative stress [114], and nuclei also contain Grx2, at 
least when over-expressed in cells [110]. Accumulating 
evidence indicates that most agents that cause oxidative 
stress do so by disrupting specific redox control path¬ 
ways associated with specific organelles (Fig. 6). 

Finally, there is a possibility that artificial intelli¬ 
gence may provide a more cost-effective means to 
monitor environmental sustainability. Irregularity of 
biologic data often reflects adaptability and health of 
the system. In essence, the accommodation of oxidative 
challenges can be expected to decrease the flexibility of 
the system. Thus, if appropriate methods were devel¬ 
oped, it may be possible to evaluate health (or sustain¬ 
ability of a system) in terms of irregularity of metabolic 
or protein data [115]. Such an approach could poten¬ 
tially be accomplished using monofractality or 
multifractality, such as expressed by Hurst or Holder 
exponents [116]. While these are not traditional 
approaches used by experimental or environmental 
biologists, they could be very useful to approach 
the rather difficult problem of environmental 
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sustainability in the face of such a large number of 
newly introduced and accumulating abundance of 
environmental agents. 

In summary, oxidative stress is an important con¬ 
cern in environmental toxicology, both as a cause of 
acute toxicity and as a contributor to chronic toxicity. 
In terms of sustainability, agents causing acute toxicity 
due to oxidative stress are more readily identified and 
managed by containment and cleanup, in some cases 
complemented by use of metal chelators and free rad¬ 
ical scavenging antioxidants. Agents that cause chronic 
toxicity or delayed reproductive or developmental tox¬ 
icity are of greater concern because of the difficulty of 
identification and association with the toxicity. At pre¬ 
sent, there is no way to evaluate whether this is a minor 
or major problem, but halogenated hydrocarbons are 
commonly used. Some of these could accumulate in the 
biosphere, and most of these are likely to contribute to 
oxidative stress. Safety testing mostly is performed only 
on commercial materials and not on products formed 
by the modification of these materials after release into 
the environment. Mass spectrometry analyses of 
human plasma reveal thousands of unidentified ions 
with unknown biologic effects. Currently there is no 
environmental exposure census to reveal whether or 
not such chemicals are accumulating due to human 
activity. Thus, there is considerable opportunity and 
need to enhance understanding of the environmental 
load of chemicals that could accumulate and cause 
adverse effects due to oxidative stress [117-122]. 
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